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@ The invention relates to hepatitis B virus antigens and 
hybrid antigens containing them and to the cloning of genes 
which code for said antigens in yeast by use of recombinant 
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Description 

Hepatitis B virus surface antigens and hybrid antigens containing them 



FIELD OF THE INVENTION 

5 The invention relates to hepatitis B virus antigens and hybrid antigens containing them and to the cloning of 
genes which code for said antigens in yeast by use of recombinant DNA techniques. 

BACKGROUND OF THE INVENTION 

10 A. Hepatitis B vaccines 

Infection with hepatitis B virus (HBV) is a serious, widespread health problem. Infection can be manifested in 
acute or chronic phases. In the United States, the number of cases of acute hepatitis is estimated to be at least 
100,000 per year with a fatality rate of 1 to 2 per cent and the prevalence of chronic carriers of HBV among 
healthy adults varies between 0.1 and 1% depending on age and social class.. In South America, the 

15 prevalence of chronic carriers is about 1 to 3%; in the U.S.S.R. and southern Europe, about 3 to 6<>/o; and in 
Asia and Africa, more than 100/o. 

In developed countries, there exists a need for a vaccine for people at high risk of exposure, such as 
patients and personnel in medical units where blood is handled, military personnel, spouses of chronic * 
carriers, travelers to areas of high HBV endemicity, newborns of chronic carriers, homosexuals, prostitutes 

20 and drug abusers. In third world countries, there exists a need for an inexpensive vaccine for mass 
immunization. Mass immunization programs may ultimately affect not only the incidence of acute hepatitis and 
the pool of chronic carriers but may also reduce the morbidity and mortality from chronic active hepatitis and * 
hepatocellular carcinoma. 

Dane particles, which are believed to be hepatitis B virions and which are isolatable from infected patients, 
25 have a diameter of about 42nm. Each consists of an envelope comprising the hepatitis B surface antigen 
(HBsAg), a capsid (HBcAg), an endogenous polymerase and a DNA genome. The genome is circular and 
double-stranded with a single strand region consisting of about 200 bases. The single strand region can be 
filled in, in vitro , by the action of the endogenous polymerase. The longer strand contains approximately 3,200 
bases. 

30 For a long time, it has been difficult to prepare HBV vaccines because it has proven difficult to propagate the 
virus in tissue culture and because the only known host is man. However, chimpanzees are also able to be 
infected in the laboratory with the virus. 

Valenzuela et al. (Nature, 298 , 347-350, 1982) report synthesis of HBsAg in yeast using an expression vector 
wherein the HBsAg coding sequence is an 835 base pair (bp) Taql -Hpal fragment and the promoter is the yeast 

35 alcohol dehydrogenase I promoter. Several earlier brief reports noted research preceding this reference. 
These are Valenzuela et al. (Arch. Biol. Med. Exp. (Chile), 14(1), 21-22, 1981) which reports expression in yeast 
of a DNA fragment containing a sequence which codes for a protein similar to HBsAg ligated to a yeast alcohol 
dehydrogenase promotor region ; a report in Scrip No. 61 6, p. 1 4 (Aug. 1 2, 1 981 ) , which states that a team of U. ~* S 

S. researchers including P. Valenzuela and W. J. Rutter have announced production in yeast of "the 

40 protein-coating surrounding hepatitis B virus"; and Zuckerman (Nature, 295, 98-99, 1982) which reports that 

W. J. Rutter has reported expression of glycosylated HBsAg in yeast cells. ~ 

Antigenic components of HBV, such as HBsAg, are reported to have been prepared in bacteria following 
insertion of a recombinant DNA molecule containing a gene which codes for the antigen. Burrell et al. (Nature, 
279 , Number 5708, 43-47, 1979) report expression in E. coli HB1 01 of HBV DNA sequences cloned in plasmid 

45 pBR322. 

Murray et al., European Patent Application Publication Number 13,828, 1980, disclose preparation of a 
recombinant vector which can code for HBV antigens, including HBsAg, in E. coli strain HB101. The vector is 
prepared from Dane particle DNA and plasmid pBR322. The authors state that useful hosts may include other 
bacterial hosts, yeasts and other fungi, animal or plant cells and other hosts, although the only host 
50 exemplified in the application is E. coli . 

Charnay, et al. (Nature, 286, 893-895, 1980) report construction of a bacteriophage carrying a fusion of the 
P-galactosidase gene and the HBsAg structural gene. The bacteriophage directs synthesis of a fusion protein 
comprising antigenic determinants of both HBsAg and (J-galactosidase. 

Tiollais et aL, UK Patent Application Number 2,034,323, disclose preparation of a coliphage containing HBV 
55 DNA. Fused phage-HBV DNA is transformed into E. coli strain C600. 

In UK Patent Application Number 2,070,621, a plasmid which comprises a part of the HBsAg gene and the 
promotor and the Z gene of the lactose operon and which can be cloned in E. coli is disclosed. 

Rutter et al., European Patent Application Publication Number 20,251, 1980, disclose recombinant vectors 
Including a recombinant vector comprising plasmid pBR322 and BamHI fragments of HBV DNA, which can be 
60 used to transform E. co|i. Another vector, comprising a BamHI fragment of HBV DNA and a portion of the 
tryptophan operon, was used to obtain expression in E. coli strain HB101, 

Edman et al. (Nature, 291, Number 5815, 503-506, 1981) describe construction of plasmids which direct - * 

synthesis of HBcAg and a p-Iactamase-HBsAg fusion protein, under the control of the tryptophan operon 
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regulatory region, in E. coll. 

Pumpen et a!. (Gene, 30, 201-210, 1984) disclose that small amounts of HBsAg monomer proteins and 
fusion proteins are synthesized in E. coll using antibodies raised against the denatured HBsAg monomer for 
detection. 

Other references disclosing insertion of HBV DNA into bacteria include Chamay. et ai. (Prog Med Virol 27 5 
88-92, (1981); MacKay et al. (Proc. Natl. Acad. Scl. U.S. 78, Number 7, 4610-4514, 1981); Fritsch et al (CR 
Acad. Scl., 287, Number 1 6, 1453,(1978) ; U. K. Patent Specification 2,034,323 (Derwent No. 46874C) and Pasek 
et al. (Nature, 282, No. 6, 575, 1979). 

HBV DNA has also been cloned in mammalian ceils. These include, human, mouse, and human hepatoma 
cell lines. For example, Dubois et al. (PrOc. Natl. Acad. Sci. U.S.., 77, Number 8, 4549-4553, 1980) report 10 
transformation of mouse cells with a ptasmld containing the HBV genome and expression of HBsAg- 
Hirschman et ai. (Proc. Natl. Acad. Scl. U.S., 77, Number 9, 5507-5511, 1980) report production of HBV-like 
particles by HeLa cells transformed with HBV DNA. 

Procedures for preparing HBV vaccine using HBsAg from human blood are reported by Funakoshi et aJ 
(Prog. Med. Virol., 27, 163-167, 1981) and Maupas et al. (Prog. Med. Virol., 27, 185-201, 1981). The vaccine 75 
prepared by Funakoshi et al. contains 40 Jig of purifted, formalin-treated HBsAg, phosphate, sodium chloride, 
20 mg of mannitol; and 0.1 0/0 of aluminum hydroxide as adjuvant. In the latter paper, Maupas et al. rfeport that 
one dose of vaccine was 1 mL of purified, formalin-treated HBsAg containing 2-10 wa/mL of protein (Lowry's 
method) and O.io/o of aluminum hydroxide. The protocol used in the study reported by Maupas et al. called for 
three injections at one month intervals with a booster after one year; the authors propose a protocol 20 
consisting of two Injections of concentrated HBsAg at three month Intervals. 

Additional references to preparation of HBV vaccines include Maupas et al., Adamowicz et al and 
Funakoshi et al. at pages 3, 37 and 57, respectively, of Hepatitis B Vaccine INSERM Symposium No. 18, edit, by 
Maupas and Guesry, 1981, Elsevier/North-Holland Biomedical Press. 

Yeasts have been used as host organisms for expression of certain other non-HBV DNA sequences. For 25 
example, Fraser et al., U. K. Patent Application 2,068,969, disclose preparation of chicken ovalbumin in yeast; 
Scrip No. 640, p. 11 (Nov. 4, 1981) contains a report that a type of Interferon is being prepared In yeast In 
European Patent 1 1,562 (Derwent No. 38762C) are reported hybrid yeast plasmlds containing the ura + yeast 
gene in the 2 \l plasmid. — 3 

The authentic Hepatitis B Virus surface antigen (HBsAg) can be recovered from plasma of Infected 30 
individuals as a particle of about 22 nm comprised of two proteins known as P24, and Its glycosylated 
derivative GP28, both of which are encoded by the 226 amino acid coding sequence on the HBV genome 
known as the S protein coding sequence. The entire 163 amino acid coding sequence which Immediately 
precedes the S protein coding sequence on the HBV genome is referred to herein as the Pre S coding 
sequence. The 55 amino acids of the Pre S coding sequence which immediately precede the S protein coding 35 
sequence are referred to herein as the Pre S2 coding sequence, and the remaining 108 amino acids of the Pre 
S coding sequence are referred to herein as the Pre S1 coding sequence. The Pre S coding sequence, or any 
smaller portion thereof, is also referred to herein as the HBsAg precursor protein. 

For completeness, It should be noted that the entire pre S coding sequence for some HBV subtypes (e.g., 
ayw) is 163 codons, while the entire Pre S coding sequence for other HBV subtypes (e.g., adw 2 ) Is 174 codons! 40 
In either case, the Pre S2 region is the 55 codons Immediately preceding the S protein coding sequence. 

The Pre S2 coding sequence codes for the polyalbumln binding or receptor site found on the surface of 
Dane . particles and on the surface of some HBsAg particles Isolated from sera of HBV-infected patterits. 
Although the Pre S2 region Immediately precedes the S protein coding region on the HBV genome, the Pre S2 
region is not involved in the assembly of HBsAg particles. See, e.g., Persing et al. (Proc, Natl. Acad. Scl. U.S A. 45 
82, 3440-3444, 1985). 

Valenzuela et al. (Nature, 298, 347-350, 1982) disclose that particles resembling HBsAg were found after 
disruption of yeast cells transformed with a vector containing the S protein coding sequence, and concluded 
that HBsAg particles are synthesized In yeast. 

Harford et al. (Developments in Biological Standardization, 54, 125-139, 1983) disclose that particles 50 
resembling HBsAg were found after disruption of yeast cells transformed with a vector containihg 18 amino- 
acids of the coding sequence of ornithine carbamoyl transferase Immediately preceding the N-termirlal 42 
amino acids of the Pre S2 coding sequence immediately preceding the S protein coding sequence 

Laub et al. (J. Virology, 48(1), 271-280, 1983) disclose construction of a Simian Virus 40 early replacement 
vector that has a large portion of HBV genome DNA Including the Pre S-S protein coding sequence, and 55 
expression of such portion by SV40- transformed CV-1 cells (COS cells) transformed by such vector 

Stibbe et al. (J. Virology, 46(2), 626-628, 1983) disclose that minor HBsAg glycoproteins, named QP33 and 
GP36, are coded for by the Pre S2-S protein coding sequence. Stibbe et al. (Dev. Biol. Stand., 54, 33-43, 1983) 
disclose that HBsAg particles containing large amounts of GP33 and GP36 did not Induce higher antIS protein 
antibody titers than those HBsAg particles almost devoid of GP33 and GP36. so 

Heerman et al. (J. Virol., 52 (2), 396-402, 1984) disclose that the entire 389 amino acid coding sequence 
comprising both the S protein coding sequence and the Pre S coding sequence codes for a polypeptide, P39, 
which is found In HBV particles and viral surface antigen filaments along with Its glycosylated form, GP42 and 
the other HBV surface antigen associated polypeptides P24, GP27, GP33 and GP36. 

Neurath et al. (Science, 224, 392-394, 1984) disclose that polypeptides having the sequence of the 26 65 
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amino-terminal amino acids of the Pre-S2 coding sequence act as an immunogen, and suggest that antibodies 
elicited by the immunogen can be utilized for diagnostic tests. 

Michel et ah (Proc. Natl. Acad. Sci. U.S.A., 81 , 7708-7712, 1984) disclose synthesis of HBsAg carrying human 
serum polyalbumin receptors in Chinese hamster ovary (CHO) cells transfected with a piasmid carrying the 
5 Pre S2-S protein coding sequence. 

Persing et al. (Proc. Natl. Acad. Sci. U.SA, 82, 3440-3444, 1985) disclose that mouse L cells transformed 
with a Pre S2-S protein coding sequence produce three HBsAg related polypeptides (24,000, 27,000 and 
35,000 daltons) all of which may be organized into complex immunoreactive HBsAg particles of 22 nm 
diameter which bind to polymerized human serum albumin (HSA); while mouse L celis transformed with the 

10 Pre S2-S protein coding sequence bearing a frame-shift mutation near the 3' end of the Pre S2 region produce 
only the 24,000 and 27,000 dalton polypeptides organized into 22 nm diameter immunoreactive HBsAg 
particles which are unable to bind HSA. Persing et ai. conclude that the Pre S2-S protein coding sequence 
encodes the 35,000 dalton species, that the Pre S2 protein accounts for the HSA binding activity of HBsAg but 
is not required for assembly and secretion of the HBsAg particles; and that the major polypeptide of HBsAg 

15 (i.e., the 24,000 dalton species) is not derived primarily by cleavage of larger precursors (i.e., the 27,000 and 
35,000 dalton species) encoded by the Pre S2-S protein coding sequence. 

Milich et ai. (Science, 228, 1 195-1 199, (1985) disclose that vaccines that contain HBV particles with both Pre 
S2 and HBsAg, prepared from Chinese Hamster Ovary (CHO) cells transfected with a piasmid containing the 
Pre S2-S protein coding sequence, can circumvent nonresponsiveness In certain mice to vaccines which just 

20 contain only HBsAg; and that the 26 amino acid residues at the NH 2 -terminus of the 33,000 dalton polypeptide 
coded for by the Pre S2-S protein coding sequence represent a dominant antibody binding site on the Pre S2 
region. 

Michel et al., "Vaccines 86", Brown et al., Ed., Cold Spring Harbor Laboratory (1986), 356-363, discuss the 
synthesis in CHO ceils of Hepatitis B surface antigen particles containing the Pre S2 region expression 
25 product. 

Neurath et al. (Nature, 315, 154-156, 1985) disclose that the Pre S region codes for proteins on the HBV 
envelope with domains specifically recognized by liver cells; the Pre S2-S protein coding sequence codes for a 
protein present in HBV particles; synthetic peptides corresponding to the gene encoding Pre-S proteins are 
immunogenic; and conclude that HBV vaccines should contain Pre S determinants. 

30 Valenzuela et al. (Biotechnology, 3, 317-320, 1985) disclose the expression of the entire pre S2-S protein 
coding sequence in yeast transformed with such coding sequence; and also disclose that such yeast cells do 
synthesize a particle containing both HBsAg and Pre S2 which is very similar In electronmicroscopy and 
sedimentation properties to particles containing only HBsAg, and that the Pre-S2 region does not infer the 
ability to form particles resembling the 22 nm HBsAg particles. Valenzuela et al. also disclose that it has been 

35 hypothesized that HBV gets into the liver by binding polyalbumin to its polyalbumin receptor which in turn 
binds the polyalbumin receptor to the liver cell and thus, the virus gets internalized, and that the polyalbumin 
receptor of the HBV is encoded by the Pre S2 region. Valenzuela concludes that a vaccine containing Pre S2 
will elicit antibodies which, in addition to inactivating HBV through the normal mechanism, might interfere with 
the way the virus enters liver cells. 

40 Takeda Chemical Ind. KK, European Patent Application Publication No. 171,908-A, claims the production of 
non-glycosylated hepatitis-B virus surface antigen P31 protein in yeast. 

Chiron Corporation, European Patent Application Publication No. 0,174,444 A2, claims a method for 
preparing HBsAg particles containing P31 protein in yeast. 
The following Table I compares known Pre S2 region amino acid sequences with the HBV pre S2 amino acid 

45 sequence of the subject invention : 



50 



55 



60 
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Table I Comparison of amino acid sequence of pre $2 
region from different serotypes of HBV 



HBV 

sub- 

tyjge 



Pre S2 coding region 



-55 -50 



-40 



-30 



-20 



-10 



Reference 



adw MQWNSTAFHQALQOPRVRGLYFPAGGSSSGTVNPAPNIASHISSSSARTGDPVTN 



adw 
adw 
adw 
adw 
adr 
adr 
ayw 
adyw 



T L 
T L 
T T 
T T 



L 
L 



V TT 

V TT 
VLTT 



P 
P 

PL 



V TTT P 



I 
I 
I 
I 

IFS 
I S 
IFS I 
IFS I 



AP 
AP 
AL 
AL 



The subject 
Pre 52 cod- 
ing region 
1 
2 
3 
4 
5 
6 
7 
8 



Amino Acid Abbreviation Kev 



Asp 


D 


Aspartic acid 


He 


I 


, Isoleucine 


Thr 


T 


Threonine 


Leu 


L 


Leucine 


Ser 


S 


Serine 


Tyr 


Y 


Tyrosine ' 


Glu 


E 


Glutamic acid 


Phe 


F 


Phenylalan 


Pro 


P 


Proline 


His 


H 


. Histidine 


Gly 


G 


Glycine 


Lys 


K 


Lysine 


Ala 


A 


Alanine 


Arg 


R 


Arginine 


Cys 


C 


Cysteine 


Trp 


W 


Tryptophan 


Val 


V 


Valine 


Gin 


Q 


Glutamine 


Met 


M 


Methionine 


Asn 


N 


Asparagine 
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B. Malaria Vaccines 

Numerous recent studies suggest that protective immunity in a sensitive host to infection by the sporozoite 
stage of a particular species of Plasmodium can be mediated via antibodies produced by such host to the 
15 circumsporozoite protein (CS protein) of the particular sporozoite. 

The sporozoite proteins of some types of Plasmodium have been cloned and some of those have been 
expressed by recombinant DNA techniques. 

Nussenzweig et al. (U.S. Patent 4,466,917) disclose a sporozoite polypeptide, identified as the P-44 protein, 
and its cloning and expression in E. coii. 
20 Sharma et al. (Science, 228, 879-882, 1985) disclose the expression of a portion of the circumsporozoite 
(CS) protein of Plasmodium knowlesi in yeast by employing an expression vector containing the 5' regulatory 
region of the yeast alcohol dehydrogenase I gene in place of the 5' upstream region of the P. knowlesi CS gene 
sequence. 

New York University, PCT application Publication Number WO84-02922-A, disclose cloning of a portion of 
25 the coding region for the P. knowlesi circumsporozoite (CS) protein repeat unit and expression of 
beta-lactamase and beta-galactosidase fusions thereof in E. coii . 

Kemp et al., PCT Patent Application Publication Number WO84-02917-A, disclose cloning and expression in 
E. coii of a coding sequence for the CS protein derived from the blood stage of P. falciparum . 

Dame et al. (Science, 225 , 593, 1984) report cloning and expression of the CS protein of P. falciparum in E. 
30 coii . The protein is described as comprising about 412 amino acids with an approximate molecular weight of 
44,000. it comprises 41 tandem repeats of a tetrapeptide. Synthetic 7-, 11- and 15- residue peptides derived 
from the repeat region bound to monoclonal antibodies raised against the CS protein. 

Ellis et al. (Nature, 302, 536-538, 1983) report expression of a beta-lactamase - P. knowlesi CS protein fusion 
in E. coii. 

35 Enea et al. ( Proc. Natl. Acad. ScL, USA , 81, 7520-7524, 1984) report an analogous repeat unit structure 
within the CS protein of F\ cynomolgi , and the cloning and expression of the CS protein in E. coii as well as a 
disclosure of its coding sequence. 

Bailou et al. (Science, 228, 996-999, 1985) disclose that synthetic peptides of the repeating region of the CS 
protein of Plasmodium falciparum conjugated to bovine serum albumin (BSA) or to thyrogiobulin produced 
40 antibody responses in mice and rabbits, and conclude that a vaccine against the sporozoite stage of the 
malaria parasite can be developed by using synthe tic peptides of the repeating region of the CS protein 
conjugated to a carrier protein. 

Weber et al. (Molecular and Bacterial Parasitology, 15, 305-316, 1985) disclose the use of a cloned CS 
protein gene of a Brazilian strain of P. falciparum as a probe to analyze the structure of 17 other P. falciparum 
45 strains by nucleic acid hybridization, and conclude that the CS protein gene is highly conserved and that 
malaria vaccine development with the CS protein is unlikely to be complicated by strain variation. 

Arnot et al. (Science, 230, 815-818, 1985) disciose the cloning of the CS protein of P. vivax and disclose its 
entire coding sequence and conclude that the coding sequence of the CS protein of P. vivax is homologous to 
that of the CS gene of P. knowlesi but not P. falciparum . 
50 Sharma et al. (Science, 229, 779-782, 1985) disciose the complete nucleotide sequence of the coding region 
of the CS gene of the Nuri strain of P. knowlesi . 

Young et al. (Science, 228, 958-962, 1985) disclose the expression of P. falciparum CS proteins In E. coll for 
potential use in a human malaria vaccine. 
The Walter and Eliza Hall Institute of Medical Research, PCT Patent Application Publication No. 
55 WO84/02917, 1984. disclose artificially constructed polynucleotide sequences substantially corresponding to 
all or a portion of P. falciparum mRNA or genomic DNA; the peptide corresponding to such sequence and a 
method for production thereof; a composition for stimulating immune responses against P. falciparum 
antigens in a mammal comprising such peptide. 
Mazier et al. (Science, 231, 156-159, 1986) disclose that antibodies raised in mice immunized with several 
GO recombinant and synthetic peptides of the circumsporozoite protein of P. falciparum were evaluated for 
protective activity In a human hepatocyte culture system and were found to exhibit a protective effect against 
the parasite at three points: sporozoite attachment to the hepatocyte surface, entry and subsequent 
intracellular development. 

65 
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C. Polyvalent vaccines 

Valenzuela et al. (Biotechnology, 3, 323-327, 1985) disclose the use of the HBsAg polyaibumin receptor 
» coded for by the Pre S2 coding sequence as a means to prepare polyvalent vaccines. Valenzuela et al. also 

disclose preparation of a hybrid HBsAg-Herpes simplex 1 virus glycoprotein D (HSV-lgD) particle by 
expressing a hybrid HSV-lgD-Pre S2-S protein coding sequence in yeast. 5 

Valenzuela, "Hepatitis B subunit vaccines using recombinant DNA Techniques," May 15, 1985 
Bio-Expo-5-Meetlng, Boston Massachussetts, discloses that with a hybrid such as the HBsAg-HSV-lgD 
particle described above, there is a problem of immunodomlnance of the HBsAg over the foreign Immunogen 
presented on its surface. 

Chiron Corporation, European Patent Application Publication No. 0,175,261 claims a novel hybrid particle w 
containing at least a major portion of the Hepatitis B surface antigen fused to one or more oligopeptides, 
where the hybrid polypeptide is capable of forming a particle in a cellular host with presentation of at least one 
epitope of at least one oligopeptide. 

Proteins can undergo post-translational modifications and some of these can profoundly affect the protein's 
conformation and function. The presence of oligosaccharide chains on human derived preS1-preS2-S and is 
preS2-S protein (Heerman et al. In J. Virol., 52, 396-402, 1984; Stibbe and Gerlich in Virology, 158, 436- 442, 
1982; Stibbe and Gerlich in J. Virol., 46, 626-628, 1983) and presumably of myristic acid on the preS1- preS2-S 
protein (Parsing et al., Abstracts of papers presented at the 1986 meeting on Molecular Biology of Hepatitis B 
viruses, August 28-31, 1986, Cold Spring Harbor Laboratory) might Influence particle formation, conformation 
, of the proteins in the particle and the antigenicity and immunogenlcity of these particles. 20 

Yeast ( Saccharomyces cerevisiae) has the enzymaticcapability for protein glycosylation (Bailou, The 
Molecular Biology of the Yeast Saccharomyces , Metabolism and Gene Expression"; Strathem, Jones and 
Broach Ed., Cold Spring Harbor Laboratory (1982), p. 335-360) and fatty acid acylation Howler and Glasler In 
c Proc. Natl. Acad. ScL, 83, 2812-2816, 1986) similar to those of higher eukaryotlc cells. 

Viral surface glycoproteins expressed in yeast have been found to be glycosylated. Jabbar et al. (Proc. Natl. 25 
Acad. ScL, 82, 2019-2023, 1985) disclose that expression of the influenza viral hemagglutinin gene in yeast 
resulted in a glycosylated hemagglutinin protein. Wen and Schleslnger (Proc. Natl. Acad. ScL, 83, 3639-3643, 
1986) disclose that expression of Sindbls and Vesicular stomatitis virus glycoproteins In yeast gave 
glycosylated forms of these viral proteins. Fujisawa et al. (Abstracts of papers presented at the 1986 meeting 
on Molecular Biology of Hepatitis B viruses, August 28-31, 1986, Cold Spring Harbor Laboratory, page 62) so 
disclose that expression of the preS2-S gene in yeast results in the synthesis of two glycosylated forms of the 
preS2-S protein. Kniskern et al. (Abstracts of papers presented at the 1986 meeting on Modern Approaches to 
New Vaccines, Including Prevention of AIDS, September 9-14, 1986, Cold Spring Harbor Labo ratory, page 89) 
disclose that expression of the preS1-preS2-S gene In yeast results in the synthesis of two preS1-preS2-S 
proteins of 45 kD and 39 kD. Persing et al. (Abstracts of papers presented at the 1986 meeting on Molecular 35 
Biology of Hepatitis B viruses, August 28-31, 1986, Cold Spring Harbor Laboratory, page 19> disclose that 
labeling of COS7 cells expressing preS1-preS2-S, preS2-S and S proteins with 8 Hmyristic acid results in the 
presence of label in the preS1-preS2-S protein. 

Fusion proteins expressed In yeast and reassembled in a particle are purified by the general scheme: 
t Aerosil adsorption and desorption (1) — ► CaCl2 precipitation (2) — ► phenylagarose or phenylboronateaga- 40 

rose adsorption and desorption (3) CsCI gradient centrffugation or DEAE-ion exchange chromatography. 

Step 1 is described in EP application publication 0 204 680 and step 3 is described In EP application 
publication 0 199 698 as far as phenylagarose is concerned. The specific combination of step (1 ) and (3) is able 
* to perform a drastic clean-up of contaminating material (elimination of 9Qo/o protein, carbohydrates and 50% 

lipids). This novel combination allows the good functioning of the next chromatographic steps. 45 

In case that fusion proteins are glycosylated, phenylboronate (PBA) is used as an affinity adsorbent. At pH 
9.0 the boronate group makes covalent complexes with potential cis-diol groups on the glycosylated 
sidechains. PBA gels have been used occasionally for the purification of soluble glycoproteins but not for the 
purification of particles containing glycoproteins embedded in the lipid bilayer (See, Cook, et al., Analytical 
Biochemistry, 149. 349-353, 1985; Middle, et al., Biochemical Journal, 209, 771-779, 1983; Cook, et ai., 50 
Biochimica et Biophysics Acta, 828, 205-212, 1985; Maestasane, et al., Journal of Chromatography, 189 , 
225-231 , 1980). Because of the multiple ligand binding sites on the particle, PBA gels with very low PB4lgand 
density have to be used, e.g., PBA gels with boronate content < 10jtg/ml gel, preferably 5jig boronate/ml gel. 
The use of phenylboronate gels with very low boronate content allow affinity chromatography of particles 
containing embedded glycoproteins. 55 

SUMMARY OF THE INVENTION 

This Invention relates to a recombinant DNA molecule comprising a functional DNA coding sequence fused, 
in phase, to a portion of the Pre S2 region of a Hepatitis B virus (HBV) Pre S2-S protein coding sequence. The 
term "coding sequence" or "coding region" as used herein also encompasses any functional derivative 60 
thereof. By the term "functional derivative" is meant a coding sequence with amino acid alterations which, 
where appropriate, don't interfere with particle formation and/or which retain Immunogenlcity where such 
retention is desirable. Such functional derivative can be prepared by conventional site specific mutagenesis 
% such as by the methods disclosed by Botstein et al. ( Science , 229 , 1193-1201, 1985. Preferably such DNA 

molecule also comprises a regulatory region which is preferably derived from the yeast arg 3 gene. 65 
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This invention also relates to a recombinant vector which comprises a recombinant DNA molecule 
operatively linked to a regulatory region, wherein the DNA molecule contains a functional DNA coding 
sequence fused, in phase, to a portion of the Pre S2 region of a HBV pre S2-S protein coding sequence. By the 
term "regulatory region" as used herein is meant a DNA sequence which contains a promotor region and other 
5 sequences necessary for transcription and translation of a coding sequence. 

This Invention also relates to a eukaryotic host cell transformed with a recombinant vector, wherein said 
vector comprises a DNA molecule operatively linked to a regulatory region, and wherein said DNA molecule 
contains a functional DNA coding sequence fused, in phase, to a portion of the pre S2 region of a HBV Pre 
S2-S protein coding sequence. This invention also relates to a method of preparing such transformed host cell 
10 which comprises transforming a eukaryotic host cell with such recombinant vector. 

This invention also relates to a method for preparing a hybrid particle containing HBsAg protein which 
contains and/or presents a peptide encoded by afunctional DNA coding sequence, which comprises cuituring 
a eukaryotic host cell transformed with a recombinant vector in appropriate culture, media and isolating the 
particle form a cell lysate or extract of such host's culture, wherein said vector comprises a DNA molecule 
15 operatively linked to a regulatory region, wherein the DNA molecule contains the functional DNA coding 
sequence fused; in phase, to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence. 

This invention also relatives to a hybrid immunogenic particle comprising a Hepatitis B virus surface antigen 
(HBsAg) particle which contains and/or presents a peptide encoded by a functional DNA coding sequence. 
This invention also relates to a vaccine comprising an immunoprotective amount of such hybrid particles. 
20 This invention also relates to micelle comprising such hybrid particle and poiysorbate and to a vaccine 
comprising an immunoprotective amount of such micelles. 

By the term 'functional DNA coding sequence" is meant a DNA coding sequence which, when fused, in 
phase, to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence, is not involved with, and 
does not interfere with, the assembly of the HBsAg particle. 
25 Preferred functional DNA coding sequences include, but are not limited to, (a) the entire HBV Pre S protein 
coding sequence, or an immunogenic derivative thereof, such as, but not limited to, the Pre S2, Pre S1 or Pre 
S1-Pre S2 protein coding sequence; and (b) other immunogenic coding sequences such as the coding 
sequence of the circumsporozoite protein of Plasmodium , or any immunogenic derivative thereof, the 
envelope gene coding sequence of HIV or any immunogenic derivative thereof, especially the C7 peptide, 
30 peptide 121 or Dreesman peptide coding sequence or an immunogenic derivative thereof. 

This invention also relates to a HBV Pre S1 protein coding region which codes for the following amino acid 
sequence: 



35 



40 



45 



50 



55 



60 



65 



-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 
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CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asa Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



-20 

Gly-Thr-Val-Asn-Pro~Ala-Pro-Asn-Ile-Ala-Ser-His-Ile--Ser- 
-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn; 



10 



15 



20 



25 



30 



35 



or to a HBV Pre S2 protein coding region or to a HBV Pre S2-S protein coding region wherein the Pre S2 
portion thereof codes for the following amino acid sequence: 

-55 -50 40 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala-Leu-Gln-Asp- 

Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 45 
Ser- 



50 



55 



a recombinant DNA vector comprising such Pre S1, Pre S2 or Pre S2-S protein coding region operatlveiy 
linked to a regulatory region; a transformed host ceil containing such vector; a process for preparing such 
transformed cell which comprises transforming a host cell with such vector; the protein coded for by such 
coding region; a process for preparing such protein; a vaccine comprising an immunoprotective amount of 
such protein, the particle code for by such Pre S2-S protein coding region; a process for preparing such en 
particle, a vaccine comprising an immunoprotective amount of such particle, and a micelle comprising such 
particle and polysorbate, and a vaccine comprising an Immunoprotective amount of such micelles. 

This invention also relates to a recombinant DNA vector comprising an entire Pre S1-Pre S2-S protein 
coding sequence or PreS1 -functional DNA coding sequence-PreS2-S protein coding sequence, or 
Immunogenic derivative thereof, operatively linked to a regulatory region ; to a yeast host cell transformed with & 
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such vector; to a method of preparing such transformed host which comprises transforming a yeast host cell 
with such vector; to a method of preparing protein encoded by the Pre S1-Pre S2-S protein coding sequence, 
or Pre-S1 -functional DNA coding sequence - Pre S2-S protein coding sequence, or an immunogenic derivative 
thereof; to the protein prepared by such method; to a vaccine comprising an immunoprotective amount of 
5 such protein ; to a particle containing polypeptide encoded by the Pre S1 -Pre S2-S protein coding sequence or 
Pre S1 -functional DNA coding sequence - Pre S2-S protein coding sequence or an immunogenic derivative 
thereof; to a method of preparation thereof, a vaccine comprising an immunoprotective amount of such 
particles, a micelle comprising such particle and polysorbate, and a vaccine comprising an immunoprotective 
amount of such Micelles. 

10 This invention also relates to a Pre S1-Pre S2 protein coding sequence or Pre S1-Pre S2-S protein coding 
sequence of which the Pre S1-Pre S2 portion comprises the following amino acid sequence: 

PRE-S1 REGION 
15 - 163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

20 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

25 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

so 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 

35 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

40 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

45 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

so 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly lie Leu Thr Thr Val Ser Thr He Pro 

55 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

60 



65 

10 



AGG CAG CCT ACT CCC 
Arg Gin Pro Thr Pro 

AGA GAC AGT CAT CCT 
Arg Asp Ser His Pro 

PRE-S2 REGION 
-55 

ATG CAG TGG AAT TCC 
Met Gin Trp Asn Ser 

GCT CTG CAG GAT CCC 
Ala Leu Gin Asp Pro 

TAT TTT CCT GCT GGT 
Tyr Phe Pro Ala Gly 

ACA GTA AAC CCT GCT 
Thr Val Asn Pro Ala 
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ATC TCT CCA CCT CTA 
lie Ser Pro Pro Leu 

CAG GCC 
Gin Ala 



ACT GCC TTC CAC CAA 
Thr Ala Phe His Gin 

AGA GTC AGG GGT CTG 
Arg Val Arg Gly Leu 

GGC TCC AGT TCA GGA 
Gly Ser Ser Ser Gly 

CCG AAT ATT GCC TCT 
Pro Asn He Ala Ser 



CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 



GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn 



This invention also relates to a recombinant vector comprising such novel Pre S1-Pre S2 or Pre S1-Pre S2-S 
protein coding sequence operatively linked to an expression control sequence; a host cell transformed wtth 
such vector; a method of preparing such transformed host which comprises transforming a host cell wfth such 
vector; to a method of preparing protein encoded by such Pre S1-Pre S2 or Pre 31 -Pre S2-S protein coding 
sequence, or an immunogenic derivative thereof; to the protein prepared by such method; to a vaccine 
comprising an immunoprotectlve amount of such protein; to a particle containing polypeptide encoded by 
such Pre S1-Pre S2-S protein coding sequence or an immunogenic derivative thereof; to a method of 
preparation thereof, a vaccine comprising an Immunoprotectlve amount of such particles, and a micelle 
comprising such particle and poiysorbate. 

The following represents the key to the amino acid abbreviations used in this application* 
Asp Aspartic acid 
Thr Threonine 
Ser Serine 
Glu Glutamic acid 
Pro Proline 
Gly Glycine 
Ala Alanine 
Cys Cysteine 
Val Valine 
Met Methionine 
lie Isoleucine 
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Leu Leucine 
Tyr Tyrosine 
Phe Phenylalanine 
His Histidine 
5 Lys Lysine 
Arg Arginine 
Tip Tryptophan 
Gin Gtutamine 
Asn Asparagine 

10 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a restriction endonuclease cleavage map of pRIT10601 . 

Figure 2 is a restriction endonuclease cleavage map of pRIT10616. 

Figure 3 is a restriction endonuclease cleavage map of pMC200. 
15 Figure 4 is the nucleotide sequence of a portion of the 3300 bp] Hindlli yeast DNA insert in pMC200 

which portion contains Hindi, Bglll and EcoRI sites. 

Figure 5 is a flow sheet illustrating preparation of pRIT10671 and pRIT10673. 

Figure 6 is aflow sheet illustrating preparation 35 of pRIT10749. 

Figure 7 is aflow sheet illustrating preparation of pRIT10761 and pRIT10764. 
20 Figure 8 is a flow sheet illustrating preparation of pRIT10167 (Example 1 ) t pRIT10158 and pRIT10162 

(Example 3); pRIT10158 t pRIT10677, and pRIT10909 (Example 4); pRIT10911 (Example 5); pRIT10679, 
pRIT10903, pRIT10914 (Example 6); pRIT122H, pRIT12209 and pRIT12230 (Example 7); and pRIT12288 
and pRIT12322 (Example 8). 

Figure 9 is a restriction endonuclease cleavage map of pRIT10172. 
25 Figure 10 is a restriction endonuclease cleavage map of pRIT1 2290. 

Figure 11 is a restriction endonuclease cleavage map of pRIT12322 which illustrates which portions 
thereof were derived from pRIT12230 and pRIT12288. 

Figure 12 is aflow sheet illustrating preparation of pRIT12571 ; pRIT12573; pRIT12572 and pRIT12574. 

Figure 13 is a restriction endonuclease map of pRIT12309. 
30 Figure 14 is aflow sheet illustrating preparation of pRIT12314 and pRIT12544. 

Figure 15 is aflow sheet illustrating preparation of pRIT12658. 

Figure 16 is a restriction endonuclease cleavage map of pRIT12662. 

Figure 17 is a flow sheet illustrating preparation of pRIT12660. 

Figure 18 is aflow sheet illustrating preparation of pRIT12845 (a,b,c,d,e,f). 
35 Figure 1 A is a flow sheet illustrating the preparation of pRIT12662. 

Figure 2A shows a schematic restriction map and sequencing strategy of clone XMPf I. 

Figure 3A shows the nucleotide sequence of the circumsporozoite protein gene from P. falciparum . 

Figure 4A shows the PreS1-PreS2 DNA coding sequence comprised by pRIT12792. 

Figure 1 B is a schematic restriction map of the HIV B virus envelope region. 
40 Figure 2B illustrates the construction of plasmid pRIT1 2893. 

Figure 3B illustrates the construction of plasmid pRIT12897. 

Figure 4B illustrates the construction of plasmid pRIT12899. 

Figure 5B illustrates the construction of plasmid pRITX. 

45 DETAILED DESCRIPTION OF THE INVENTION 

EXPRESSION OF HBsAg IN YEAST 
By "regulatory region" as used herein is meant a DNA sequence which contains sequences necessary or 

desirable for transection and translation of a coding sequence. Such regulatory regions are generally placed 
50 5' and adjacent to the coding sequence which is to be expressed. Preferably a further region of yeast DNA 

containing a signal for the termination of transcription is placed immediately 3' to the coding region to be 

expressed so as to effect termination of the transcript. 
By "HBsAg" as used herein is meant an antigen containing proteins encoded by the S protein coding 

sequence of the HBV genome, wherein such HBsAg is structurally similar to authentic HBsAg or has 
55 substantially the same S protein coding sequence encoded antigenic determinants as authentic HBsAg, that 

is, is capable of stimulating an immune response which speci fically recognize and react with authentic HBsAg 

or is specifically recognized by anti-HBsAg antibodies. 
The HBsAg coding sequence can be isolated from DNA extracted from Dane particles in infected human 

serum by filling in the single strand region with a DNA polymerase, preferably the endogenous polymerase, 
60 followed by digestion with a restriction endonuclease. The choice of endonuclease will depend, in part, on the 

particular Dane particles. For example, the HBsAg coding sequence of HBV DNA of certain Dane particles of 

the adw serotype can be isolated on a single BamH I fragment; the HBsAg coding sequence of HBV DNA of 

certain Dane particles of the ayw serotype can be isolated on a Hha l fragment. HBV DNA of Dane particles of 

the same serotype may also exhibit different patterns of restriction sites. 
65 Restriction of DNA to prepare DNA fragments used in the invention, ligation of such fragments to prepare 
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recombinant DNA molecules used in the invention and insertion into microorganisms are carried out by known 
techniques such as techniques disclosed in the previously and subsequently cited references. Conditions are 
selected to avoid denaturation of the DNA and enzymes. For example, the pH is buffered to remain at about 7.0 
to 11.0 and the temperature is kept below about 60°C. Preferably restriction is carried out at a temperature of 
about 30 to 40° C and ligation is carried out at about 0 to 10°C. Restriction enzymes and ligases used in 5 
carrying out this invention are commercially available and should be used in accordance with Instructions 
included therewith. T4 DNA ligase is the preferred iigase. 

The various fragments and final constructions may be joined together in accordance with conventional 
ways. In many cases, genes have been isolated and res friction mapped, as well as sequenced. To that extent, 
one can select the sequence of interest, such as the HBsAg sequence, by restriction of the gene, employing 10 
further manipulation as necessary such as resection with BaJ31 , in vitro mutagenesis, primer repair, or the like, 
to provide a fragment of a desired size, including the desired sequence, and having the appropriate termini! 
Linkers and adapters can be used for joining sequences, as well as replacing lost sequences, where a 
restriction site employed was internal to the region of interest. The various fragments which are isolated, may 
be purified by electrophoresis, electroeluted, ligated to other sequences, cloned, reisolated and further is 
manipulated. 

The use of regulatory regions of controlling transcription of the structural gene of interest, such as the 
HBsAg sequence, may allow for growing the host cells to high density with no or low levels of expression of the 
structural gene, such as the HBsAg sequence, and then inducing expression by changing the environmental 
conditions, e.g., nutrient, temperature, etc. ^ 

For example, with the GAL4 regulatory region, the yeast cells can be grown in rich media with a 
glyceroMactic acid combination to high density, e.g., mid or late log phase, followed by switching the carbon 
source to galactose. As another example, for PH05 regulation, one could grow the cells at a high phosphate 
concentration of about 7 mM KH2PO4, and then recover and resuspend them in medium lacking phosphate to 
effect derepression and expression. For temperature sensitivity, one could grow the cells at 25° to 37° C and 25 
then change the temperature as appropriate by about 5° to 20° C. The host cells would have the regulatory 
system associated with the regulatory region employed. 

Fusing of the HBsAg sequence to the regulatory region can be accomplished by use of intermediate 
vectors. Alternatively, the HBsAg sequence can be inserted directly into a vector which contains the regulatory 
region. A vector is DNA which can carry and maintain the DNA fragment the invention, including, for example, 30 
phages and piasmids. Techniques for cloning DNA fragments in phages are disclosed, for example by 
Charnay et al. Nature , Volume 286, 893-895 (1980) and Tioliais, United Kingdom Patent Application 2,034,323. 
Preferably, the HBsAg sequence Is positioned relative to the regulatory region such that the HBsAg 
synthesized by expression of the HBsAg sequence is devoid of extraneous amino acid residues. 

In one embodiment of this invention the HBsAg coding sequence together with 42 codons of the HBsAg 35 
precursor (preS-S) protein were fused in phase after the 18th amino acid of the coding sequence of the yeast 
OCT protein which is transcribed from the arg 3 promoter. This construct gives a hybrid particle containing 
HBsAg protein containing an additional 60 N-terminal amino acids to the 226 amino acids of the S-protein 
proper. 

An exemplary regulatory region is derived from the yeast arg 3 gene which codes for ornithine 40 
carbamoyl-transferase (OCT). The arg 3 regulatory region is subject to both specific and general control 
mechanisms. It has been cloned in E. coji on plasmid pYeura 3 arg 3 as reported by Crabeei et a!., Proc Na tl 
Acad. Sci. U.S.A., Volume 78, 6026 (1981). Preferred hosts are S. cerevislae strains In which the argtnine 
biosynthetic pathway is derepressed, such as strain IcI697d. Use of such strains results in increased 
expression from the arg 3 promoter as compared to strain DCS. Examples of other useful regulatory regions 45 
include those derived from yeast genes of the glycolytic pathway such as those coding for enolase, aldolase, 
phosphoglycerate kinase and glyceraldehyde^3-phosphate dehydrogenase; the yeast alcohol dehydrogenase 
gene; and, in general, genes involved In catabolism such as the arginase gene. 

A preferred vector for cloning the fused DNA fragment into yeast Is the plasmid YEp13, which Is capable of 
replication and maintenance in both E. coll and S. cerevisiae and Is, therefore, known as a shuttle vector 50 
Several other yeast vectors are known and available. The HBsAg and regulatory regions can be inserted Into a 
yeast vector sequentially or simultaneously. Transformation with plasmid vectors containing autonomous 
replication sequences (repiicons) functional in yeast will normally result in extra-chromosomai maintenance of 
the DNA molecule of the invention as a plasmid. Such repiicons are well known in the art. Transformation with 
plasmid vectors not containing repiicons functional In yeast can result in incorporation of the DNA molecule 55 
into chromosomal DNA. 

Vaccines for stimulating protection against HBV infection in humans comprising HBsAg produced by yeast 
in accordance with the invention and a suitable carrier can be prepared by known techniques. Use of an 
adjuvant, such as aluminum hydroxide or aluminum phosphate, is desirable. The HBsAg so produced can also 
be combined with other immunogens to prepare combination vaccines. The HBV or combination vaccines can 60 
be administered, for example, by the subcutaneous, Intravenous or intramuscular route. The DNA fragment of 
the Invention and the HBsAg produced thereby, can also be used as a probe for detection of HBV in biological 
samples by DNA hydridization techniques and various Immunoassays. 

65 



13 



0278940 

EXPRESSION OF A DNA MOLECULE CONTAINING A FUNCTIONAL DMA CODING SEQUENCE FUSED, IN 
PHASE, TO A PORTION OF THE PRE S2 REGION OF A HBV PRE S2-S PROTEIN CODING SEQUENCE 
This invention also relates to a recombinant DNA molecule comprising a functional DNA coding sequence 
fused, in phase, to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence. The term "coding 

5 sequence" or "coding region" as used herein also encompasses any functional derivative thereof. By the term 
"functional derivative" is meant a coding sequence involving amino acid alterations which, where desired, 
don't intefere with particle formation and/or which retain immunogenicity. Such functional derivatives can be 
prepared by conventional site specific mutagenesis. Site specific mutagenesis techniques are taught by 
Botstein et al., Science , 229 , 1193-1201 (1985). Such fusion may occur at the N-terminus of the Pre S2 region 

10 or at some point within the Pre S2 region. There can be four types of fusions, i.e., at the S'-terminus of the 
entire Pre S2 region, at some point within the Pre S2 region so that Pre S2 DNA is flanking both sides of the 
functional DNA coding sequence, at the 5'-terminai of a truncated portion of the Pre S2 region, or at the 
5'-terminus of the S-protein coding region so that the fusion contains no pre S2 DNA. Various Pre S2-S protein 
coding sequences are known and can be prepared or isolated by known techniques. [See, references cited 

15 herein, supra] . In one embodiment of this invention, the ARG3 regulatory region together with 18 amino acids 
of the yeast OCT protein were fused, in phase, to a truncated Pre S2-S protein coding sequence such that the 
fused sequence contained 42 codons of the Pre S2-S protein coding sequence together with the complete S 
protein coding sequence. Preferred functional DNA coding sequences include the circumsporozoite protein 
coding sequence of Plasmodium , or any immunogenic derivative thereof, the envelope gene coding sequence 

20 of HIV or any immunogenic derivative thereof, especially the C7 peptide, peptide 121 or Dreesman peptide 
coding sequence or any immunogenic derivative thereof, and the Pre S2, Pre S1 or the entire Pre S1-Pre S2-S 
protein coding sequence or any immunogenic derivative thereof. For example, a preferred functional DNA 
coding sequence is the first 13 Nterminal codons of the Pre S2 coding sequence which are then ligated to the 
C-terminal 42 codons of a truncated pre S2 region of a Pre S2-S protein coding sequence. By the term 

25 "immunogenic derivative" is meant a sequence which isa derivative or modified version of a naturally occurring 
sequence which retains or enhances the protective immunogenic activity of the natural sequence. Such 
immunogenic derivatives can be prepared by conventional techniques. The term "Pre S2, Pre S1 or the entire 
Pre S1-Pre S2-S protein coding sequence" as used herein includes any immunogenic derivative thereof. 
By the term " the circumsporozoite protein of Plasmodium * is meant the immunodominant surface antigen 

30 on the sporozoite of Plasmodium or any immunogenic derivative thereof. Useful circumsporozoite proteins 
(CS) include, but are not limited to, those derived from Plasmodium falciparum , P. vivax, P. ovale , and P. 
malariae . P. falciparum , P. vivax , P. ovale and P.malariae infect humans. 

The CS protein coding sequence of Plasmodium falciparum is disclosed by Dame et al., Science , 225 , 
593-599 (1984). The CS protein coding sequence of P. vivax is disclosed by Arnot et al., Science , 230 , 815-818 

35 (1985). The CS protein coding sequence of P. know lesi is disclosed by Sharma et al., Science , 229 , 779-782 
(1985). Enea et al., Proc. Natl. Acad. Sci. USA , 81, 7520-7524 (1984), disclose the CS protein coding sequence 
of P.cynomolgi . The CS protein coding sequences of other species of Plasmodium are known (see references 
cited above) and/or can be derived by conventional techniques. 
As used herein, the terms "CSP" or "CS protein" will include both the natural, full length circumsporozoite 

40 protein of Plasmodium as well as any Immunogenic derivative thereof. By the term "immunogenic derivative" of 
the circumsporozoite protein of Plasmodium is meant (a) any derivative of the circumsporozoite protein of 
Plasmodium which retains protective immunogenic activity or (b) any protein derived from a modified coding 
sequence of the circumsporozoite protein of Plasmodium which retains protective immunogenic activity. Such 
immunogenic derivatives can be prepared by conventional techniques. Some immunogenic synthetic 

45 derivatives of the CS protein coding sequence of P. falciparum are disclosed in Ballou et al., Science , 228 , 
996-999 (1985). See also, Mazier et al., Science , 231 , 156-159 (1986). 

The CS coding sequence for various species of Plasmodium is known and can be prepared or isolated by 
known techniques. [See, e.g., Colman et al., WO84-02922-A (P. knowlesi CS protein) and Dame et al., Science , 
225 , 593 (1984) (P. falciparum CS protein)]. A DNA molecule containing the CS coding sequence, or any other 

50 functional DNA coding sequence, fused in phase to a portion of the Pre S2 region of a HBV Pre S2-S protein 
coding sequence can be constructed by conventional techniques, such as by ligating the CS coding 
sequence, or any other functional DNA coding sequence, in phase to any codon within the 55 amino acid Pre 
S2 coding region of the Pre S2-S protein coding sequence. 
Preferably, enough of the Pre S2 coding sequence remains after ligation of the CS coding sequence, or any 

55 other functional DNA coding sequence, to insure optimal presentation of the CS protein, or any other 
functional DNA coding sequence protein, on the HBsAg particle surface, i.e., enough Pre S2 region should 
remain so that the Pre S2 region encoded polypeptide acts as a bridge between the functional DNA protein 
coding sequence product and the HBsAg particle surface. It has been reported that the 26 amlno-terminal 
amino acids of the Pre S2 coding sequence represent a dominant antibody binding site on the Pre S2 region. 

60 [See, e.g., Milich et al., Science , 228 , 1 195-1199 (1985)]. Thus, if it is desired to prepare a hybrid particle which 
contains and/or presents both Pre S2 epitopes and the CS, or any other functional coding sequence, epitopes, 
it is preferable to insert the CS coding sequence, or any other functional DNA coding sequence, at a point 
within the Pre S2 coding region which enables preparation of a hybrid particle which contains and/or presents 
both Pre S2 epitopes and CS, or any other functional DNA coding sequence, epitopes. 

65 The recombinant vector of this invention comprises the recombinant DNA molecule of the subject invention 
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(i.e., a functional DNA coding sequence fused, In phase, to a portion of the Pre S2 region of a HBV Pre S2-S 
protein coding sequence) operatively linked to a regulatory region. Such vector may additionally also comprise 
a replicon functional in yeast. By the term "replicon" Is meant that minimum region of DNA which functions to 
maintain stably, and extrachromosomally, such recombinant vector in a host yeast organism transformed with 
such vector. Such repiicons are well known in the art. By the term "regulatory region" is meant any DNA region 5 
which regulates transcription and translation of the coding sequence desired to be expressed. Such 
regulatory regions are well known in the art. Such a vector can be prepared by conventional techniques such 
as by inserting the DNA molecule of this invention In phase into a vector already containing a replicon and 
regulatory region in such a way that the DNA molecule is operatrvely linked to such regulatory region 
Preferably the DNA molecule is ligated to a vector capable of expression In the yeast genus Saccharorn yces 10 

This invention also relates to a yeast host cell transformed with such recombinant vector and to a method of 
preparing such transformed host which comprises transforming a yeast host cell with the recombinant vector 
of this invention. Such transformation is carried out by conventional techniques such as those already 
described herein. Preferred yeast cells include those belonging to the genus Saccharornyces , particularly 
those belonging to the species Saccharornyces cerevisiae and Saccharornyces carlsbergensls . 75 

This invention also relates to a method of preparing a hybrid particle containing HBsAg protein which 
contains and/or presents the functional DNA coding sequence product wherein such method comprises 
culturing the transformed yeast host ceils of this invention in appropriate culture media, and isolating the 
particle from a cell lysate or extract of such host's culture. By the term 'presented' or 'contains and/or 
presents" is meant that the functional DNA coding sequence protein is contained within the hybrid particle 20 
containing HBsAg protein in such a way as to enable an immune response to the functional DNA coding 
sequence protein to be produced. When desirable, although not essen tial, the presentation of the functional 
DNA coding sequence protein does not interfere wfth the Immunogenic activity of the HBsAg epitopes, thus 
yielding a bivalent vaccine. Most preferably the functional DNA coding sequence protein Is presented on the 
HBsAg particle in such a way that the maximum number of either the functional DNA coding sequence 25 
protein's epitopes or both the functional DNA coding sequence protein and HBsAg epttopes are properly 
exposed. By "appropriate culture media" is meant that media which will enable the transformed host to survive 
and which will also enable such host to express the hybrid functional DNA coding sequence-Pre S2-S protein 
coding sequence protein product of the vector it is transformed with in recoverable quantity. It will be 
appreciated by one of skill in the art that the appropriate culture media to use will depend upon the host cell 30 
employed. The isolation of the hybrid particle of this invention from a cell lysate or extract of such host's 
culture is carried out by conventional protein Isolation techniques. 

This invention also relates to a vaccine containing the hybrid particles of this invention. Such vaccine will 
contain an immunoprotective quantity of the hybrid particles of this Invention and is prepared by conventional 
techniques. By "immunoprotective" is meant that enough of the hybrid particles of this invention are 35 
administered to elicit sufficient protective antibody response against the agent desired to be protected against 
without serious side effects. Preferably, the vaccine of this invention comprises the micelle of this invention, 
i.e., a micelle comprising the particle of this Invention and polysorbate. The preferred amount of polysorbate 
comprised by such micelle is 5-50 \ig polysorbate per 100 pg protein. The micelle may be prepared by the 
method disclosed in European Patent Application Publication No. 0 199 698. 40 

In the vaccine of the invention, an aqueous solution of the hybrid particles of the Invention, preferably 
buffered at physiological pH, can be used directly. Alternatively, the particle, can be admixed or adsorbed with 
any of the various known adjuvants. Such adjuvants include, among others, aluminum hydroxide. 

Vaccine preparation is generally described in New Trends and Developments in Vaccines, edited by Voller et 
al., University Park Press, Baltimore, Maryland, U.S.A., 1978. 45 

The amount of the hybrid particle of the present Invention present in each vaccine dose Is selected as an 
amount which induces an immunoprotective response without significant, adverse side effects in typical 
vaccines. Such amount will vary depending upon which specific immunogen is employed and whether or not 
the vaccine is adjuvanted. Generally, it is expected that each dose will comprise 1-1000 jig of protein 
preferably 1-200 ng. An optimal amount for a particular vaccine can be ascertained by standard studies so 
involving observation of antibody titres and other responses in subjects. Following an Initial vaccination 
subjects will preferably receive a boost In about 4 weeks, followed by repeated boosts every six months for as 
long as a risk of infection exists. 

The immune response to the hybrid particle of this invention may be enhanced by the use of adjuvant, such 
as, but not limited to, aluminum hydroxide (alum). ' 

To exemplify the hybrid particle of this Invention, a 64 codon DNA sequence coding for the repetitive epitope 
of the clrcumsporozoite protein (CS) of Plasmodium falciparum and an eight codon CS coding sequence [see 
Dame et al., Science , 225, 593-599 (1984)] were each inserted in phase to a portion of the Pre S2 coding 
sequence in a yeast expression vector comprising a replicon, the Pre S2-S protein coding sequence and an 
appropriate regulatory region. Both expression vectors, I.e., one containing the 64 codon Cs sequence and 60 
one containing the 8 codon CS sequence, were each introduced into yeast host cells, and extracts of such 
transformed ceils were analyzed for the presence of HBsAg epitopes and CS epitopes on the same molecule. 

The results obtained from Immunoblot analyses using a pool of five monoclonal antibodies specific for CS 
and a monoclonal antibody raised against purified yeast-derived HBsAg showed that a 37,000 kilodalton (Kd) 
hybrid protein was obtained from lysates of yeast transformed with the vector containing the 64 codon CS 65 
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coding sequence fused In phase with the Pre S2-S protein coding sequence, and a 30,000 Kd hybrid protein 
was obtained from lysates of yeast transformed with the vector containing the 8 codon CS coding sequence 
fused in phase with the Pre S2-S protein coding sequence. Assembly of these proteins into HBsAg 
particle-like structures was shown by CsCI and sucrose gradient analysis, high pressure liquid 
5 chromatography (HPLC) and electron microscopy. Presentation of the CS epitopes on the exterior of such 
particles was shown by the accessibility of the Cs epitope to CS specific antibodies in an ELiSA assay. 
Presentation of HBsAg epitopes on the exterior of such particles was shown by radioimmune assay (AUSTRIA 
II, Abbott) or EUSA assay (Enzygnost-HBsAg, Behringwerke). 

10 NOVEL PRE S2 AND PRE S2-S PROTEIN CODING SEQUENCES 

The novel HBV Pre S2 and Pre S2-S protein coding sequences of this invention are from an HBV adw 
subtype which was isolated from plasmid pRn"10616. The term "coding sequence" or "coding region" as used 
herein also encompasses any functional derivative thereof. By the term "functional derivative" is meant a 
coding sequence with amino acid alterations which, where appropriate, don't interfere with particle formation 

15 and/or which retain immunogenic^ where such retention is desirable. Such functional derivatives can be 
prepared by conventional site specific mutagenesis such as by the method of Botstein et al., Science , 229 , 
1193-1201 (1985). Such Pre S2 and Pre S2-S protein coding sequences can be recovered by conventional 
recombinant DNA procedures from plasmid pRIT10616 which was deposited (within Ecoli K1 2 strain C600) in 
accordance with the Budapest Treaty at the American Type Culture Collection, Rockviile, Maryland, on June 2, 

20 1982, under accession number ATCC 38131. The Pre S2 region of such novel protein coding sequences codes 
for the following amino acid sequence: 

-55 -50 
25 Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala-I^eu-Gln-Asp- 

-40 -30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 



30 



-20 

Ser-Gly-ThE-Val-Asn-Pro-Ala-Pro-Asn-I le-Ala-Sec-His-Ile- 



-10 

35 Ser-Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn. 

Such region may also be obtained by conventional synthetic oligonucleotide synthesis. A preferred 
functional derivative of the Pre S2-S coding sequence of this invention is one wherein the alanine (GCT) amino 

40 acid at -46 is modified via site specific mutagenesis to a threonine (ACT). Such functional derivative results in 
two times greater level of expression by Saccharomyces cerevesiae transformed therewith as com pared to 
the unmodified coding sequence of this invention. 

A recombinant vector of this invention comprises the Pre S2 or Pre-S2-S protein coding sequence of the 
subject invention operatively linked to a regulatory region. Such vector may also additionally contain a replicon 

45 depending on preference and/or the host to be employed. By the term "replicon" is meant that minimum 
region of DNA which functions to maintain stably, and extrachromosomally, such recombinant vector in a host 
eukaryotic or prokaryotic organism transformed with such vector. Such replicons are well-known in the art. By 
the term "regulatory region' is meant any DNA sequence which contains a promoter region and other 
sequences necessary for transcription of a coding sequence. Such regulatory regions are well-known in the 

50 art. Such a vector can be prepared by conventional techniques such as by Inserting the Pre S2 or Pre S2-S 
protein coding sequence of this invention in phase into a vector already containing a replicon and regulatory 
region in such a way that the DNA molecule is operatively linked to such regulatory region. Preferably the DNA 
molecule is ligated to a vector capable of expression in yeast such as Saccharomyces . 
This invention also relates to a host cell transformed with such recombinant vector and to a method of 

55 preparing such transformed host. Such transformation is carried out by conventional techniques. Host cells 
can be prokaryotic or eukaryotic. Preferred host cells are eukaryotic especially yeast cells and include those 
belonging to the genus Saccharomyces , particularly those belonging to the species Saccharomycescerevislae 
and Saccharomyces carlsbergensis . 
This invention also relates to a method of preparing a protein encoded by the Pre S2 or Pre S2-S pro tein 

60 coding sequence of this invention wherein such method comprises culturing the transformed host cells of this 
Invention in appropriate culture media, and isolating the protein from the host cell culture fluid or from a cell 
lysate or extract of such host's culture depending upon the host cell's ability to secrete such protein. By 
"appropriate culture media" Is meant that media which will enable the transformed host of this invention to 
survive and which will also enable such host to express the Pre S2 or Pre S2-S protein coding sequence of this 

65 invention in recoverable quantity, it will be appreciated by one of skill in the art that the appropriate culture 
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media to use will depend upon the host cell employed. The isolation of the Pre S2-S protein of this Invention 
from a culture lysate or the culture fluid of such host is carried out by conventional protein isolation 
techniques. The protein prepared by the method of this invention may be glycosylated depending upon the 
host cell's ability to do so. If a degiycosylated protein is desired, the protein of this Invention should be 
prepared by utilizing a host cell incapable of effecting such glycosylate or by employing conventional site s 
specific mutagenesis on the protein's coding sequence prior to carrying out the process of this invention, 
such as by the method of Botstein et al., Science , 229 , 1193-1201 (1985). 

This invention also relates to a vaccine containing an immunoprotective amount of the Pre S2 or Pre S2-S 
protein of this invention. Such vaccine may be prepared by conventional techniques. 

This Invention also relates to a method of preparing a hybrid particle containing HBsAg protein which 10 
comprises protein encoded by the PreS2-S protein coding sequence of this invention wherein such method 
comprises culturing the transformed eukaryotic host cells of this invention in appropriate culture media, and 
isolating the particle from the host cell culture fluid or from a cell lysate or extract of such host's culture 
depending upon the transformed host cells ability to secrete such particles. By •appropriate culture media' Is 
meant that media which will enable the transformed host of this invention to survive and which will also enable 75 
such host to express the Pre S2-S protein coding sequence of this invention In particle form and In recoverable 
quantity. It will be appreciated by one of skill in the art that the appropriate culture media to use will depend 
upon the host cell employed. The isolation of the hybrid particle of this invention from a culture lysate or the 
culture fluid of such host Is carried out by conventional Isolation techniques. 

This invention also relates to a vaccine containing the hybrid particles of this invention. Such vaccine will 20 
contain an immunoprotective quantity of the hybrid particles of this Invention and are prepared by conventional 
techniques. Preferably, the vaccine of this invention comprises the micelle of this Invention, I.e., a micelle 
comprising the particle of this invention and polysorbate. The preferred amount of polysorbate comprised by 
such micelle is 5-50 \ig polysorbate per 100 \uq protein. The micelle may be prepared by the method disclosed 
in European Patent Application Publication No. 0 199 698. 2s 

In the vaccine of the invention, an aqueous solution of the Pre S2-S protein or hybrid particle of the 
invention, preferably buffered at physiological pH, can be used directly. Alternatively, the protein or particle 
can be admixed or adsorbed with any of the various known adjuvants. Such adjuvants Include, among others 
aluminum hydroxide. ' 

Vaccine preparation is generally described in New Trends and Developments In Vaccines, edited by Volier et 30 
al.. University Park Press, Baltimore. Maryland, U.SA, 1978. Encapsulation within liposomes is described, for 
example, by Fulierton, U.S. Patent 4,235,877. Conjugation of proteins to macromolecules Is disclosed, for 
example, by Likhite, U.S. Patent 4,372,945 and by Armor et al., U.S. Patent 4,474,757. 

The amount of the Pre S2-S protein or hybrid particle of the present invention contained in each vaccine 
dose is selected as an amount which induces an immunoprotective response without significant, adverse side 35 
effects in typical vaccines. Such amount will vary depending upon which specific immunogen is employed and 
whether or not the vaccine is adjuvanted. Generally, it Is expected that each dose will comprise 1-1000 jig of 
protein, preferably 1-200 jig. An optimal amount for a particular vaccine can be ascertained by standard 
studies Involving observation of antibody titres and other responses in subjects. Following an initial 
vaccination, subjects will preferably receive a boost in about 4 weeks, followed by repeated boosts every six 40 
months for as long as a risk of infection exists. 

The immune response to the Pre S2-S protein or hybrid particle of this invention Is enhanced by the use of 
adjuvant such as, but not limited to, alum. 

NOVEL PRE S1 PROTEIN CODING SEQUENCE # 

The novel HBV Pre S1 protein coding sequence of this invention is from an HBV adw subtype which was 
isolated from plasmid pRIT10616. The term "coding sequence" or "coding region" as used herein also 
encompasses any functional derivative thereof. By the term "functional derivative" is meant a coding sequence 
involving amino acid alterations which, when desired, don't interfere with particle formation and/or which retain 
immunogenicity. Such functional derivatives can be prepared by conventional site specific mutagenesis such so 
as by the method of Botstein et a!., Science , 229 , 1 193-1201 (1985). Such Pre S1 protein coding sequence can 
be recovered by conventional recombinant DNA procedures from plasmid pRrT10616 which was deposited 
(within E. coli K12 strain C600) in accordance with the Budapest Treaty at the American Type Culture 
Collection, Rockville, Maryland, on June 2, 1982 under accession number ATCC 38131. The Pre S1 region 
codes for the following amino acid sequence: 55 
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ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 

Met Gly Thr Asn Leu Sec Val Pro Asn Pro Leu ♦ 

5 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 

Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

10 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

15 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 

Trp Asp Phe Asn Pro lie Lys' Asp His Trp 

20 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 

Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

25 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 

Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

30 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 

Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

35 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 

Gly He Leu Thr Thr Val Ser Thr He Pro 

40 s 
CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 

Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly * 

45 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

50 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 

55 



60 

Such region may also be obtained by conventional synthetic oligonucleotide synthesis. 

A recombinant vector of this invention comprises the Pre S1 protein coding sequence of the subject 
invention operatively linked to a regulatory region and may additionally comprise a replicon depending on 
preference and/or the host to be employed. By the term "replicon 1 ' is meant that minimum region of DNA 
which functions to maintain stably, and extrachromosomally, such recombinant vector in a host eukaryotlc or 
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prokaryotic organism transformed with such vector. Such replicons are well-known in the art. By the term 
"regulatory region" is meant a DNA sequence which contains a promoter region and other sequences 
necessary for transcription of a coding sequence, Such regulatory regions are well-known in the art Such a 
vector can be prepared by conventional techniques such as by inserting the Pre S1 protein coding sequence 
of this invention in phase into a vector already containing a replicon and regulatory region in such a way that 5 
the DNA molecule is operatively linked to such regulatory region. Preferably the DNA molecule is ligated to a 
vector capable of expression In yeast such as Saccharomyces . 

This invention also relates to a host cell transformed with such recombinant vector and to a method of 
preparing such transformed host. Such transformation is carried out by conventional techniques. Host cells 
can be prokaryotic or eukaryotlc. Preferred host cells are eukaryotlc especially yeast cells and include those 10 
belonging to the genus Saccharomyces , particularly those belonging to the species Saccharomyces 
cerevislae and Saccharomyces carisbergensis . ~ — 

This invention also relates to a method of preparing protein encoded by the Pre S1 protein coding sequence 
of this invention wherein such method comprises cuituring the transformed host cells of this invention In 
appropriate culture media, and Isolating the protein from the host ceil culture fluid or from a cell iysate or 15 
extract of such host's culture depending upon the transformed host cell's ability to secrete such protein. By 
"appropriate culture media" is meant that media which will enable the transformed host of this Invention to 
survive and which will also enable such host to express the Pre S1 protein coding sequence of this invention In 
recoverable quantity. It will be appreciated by one of skill In the art that the appropriate culture media to use will 
depend upon the host cell employed. The isolation of the Pre S1 protein of this invention from a culture Iysate 20 
or the culture fluid of such host is carried out by conventional isolation techniques. The protein prepared by the 
method of this invention may be glycosylated depending upon the host cell's ability to do so. If a 
deglycosylated protein is desired, the protein of this invention should be prepared by utilizing a host cell 
incapable of effecting such glycosylate or by employing conventional site specific mutagenesis on the 
protein's coding sequence prior to carrying out the process of this invention. Site specific mutagenesis Is 25 
carried out by such as by the method of Botsteln et al., Science , 229 , 1193-1201 (1986). 

This invention also relates to a vaccine containing the Pre S1 protein of this invention. Such vaccine will 
contain an immunoprotective quantity of the Pre S1 protein of this Invention and is prepared by conventional 
techniques. 

In the vaccine of the invention, an aqueous solution of the Pre S1 protein of the invention, preferably 30 
buffered at physiological pH, can be used directly. Alternatively, the Pre S1 protein, can be admixed or 
adsorbed with any of the various known adjuvants. Such adjuvants include, among others, aluminum 
hydroxide. 

Vaccine preparation is generally described in New Trends and Developments in Vaccines, edited by Voller et 
al., University Park Press, Baltimore, Maryland, U.S.A., 1978. Encapsulation within liposomes Is described, for 35 
example, by Fulierton, U.S. Patent 4,235,877. Conjugation of proteins to macromolecules is disclosed, for 
example, by Likhite, U.S. Patent 4,372,945 and by Armor et al., U.S. Patent 4,474,757. 

The amoung of the Pre S1 protein of the present invention present in each vaccine dose is selected as an 
amount which induces an immunoprotective response without significant, adverse side effects In typical 
vaccines. Such amount will vary depending upon which specific immunogen is employed and whether or not 40 
the vaccine is adjuvanted. Generally, it is expected that each dose will comprise 1-1000 jig of protein, 
preferably 1-200 jxg. An optimal amount for a particular vaccine can be ascertained by standard studies 
involving observation of antibody titres and other responses In subjects. Following an initial vaccination, 
subjects will preferably receive a boost in about 4 weeks, followed by repeated boosts every six months for as 
long as a risk of infection exists. The immune response to the PreS1 protein of this invention is enhanced by 45 
the use of adjuvant such as, but not limited to, alum. 

NOVEL PRES1 -PRES2-S PROTEIN CODING SEQUENCE 

This invention also relates to a HBV PreS1-PreS2-S protein coding sequence of which the PreS1-PreS2 
portion comprises the following amino acid sequence: 50 



55 



60 
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PRE-S1 REGION 
-163 . 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 

5 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 
GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 

10 

Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

« GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

20 TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 

25 CCA GCA GCC AAC CAG GTA GGA* GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

30 TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

35 GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

40 GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT * 
Gly He Leu Thr Thr Val Ser Thr He Pro 

45 



50 



55 



60 
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CCT CCT GCC TCC ACC 

Pro Pro Ala Set Thr 

AGG CAG CCT ACT CCC 
Arg Gin Pro Thr Pro 

AGA GAC AGT CAT CCT 
Arg Asp Ser His Pro 
PRE-S2 REGION 

-55 

ATG CAG TGG AAT TCC 
Met Gin Trp Asn Ser 

GCT CTG CAG GAT CCC 
Ala Leu Gin Asp Pro 

TAT TTT CCT GCT GGT 
Tyr Phe Pro Ala Gly 

ACA GTA AAC CCT GCT 
Thr val Asn Pro Ala 

CAC ATA TCG TCA AGC 
His He Ser Set Ser 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn 
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AAT CGG CAG TCA GGA 

Asn Arg Gin Ser Gly 

ATC TCT CCA CCT CTA 
He Ser Pro Pro Leu 

CAG GCC 
Gin Ala 



ACT GCC TTC CAC CAA 
Thr Ala Phe His Gin 

AGA GTC AGG GGT CTG 
Arg Val Arg Gly Leu 

GGC TCC AGT TCA GGA 
Gly Ser Ser Ser Gly 

CCG AAT ATT GCC TCT 
Pro Asn He Ala Sec 

TCC GCG' AGG ACT GGG 
Ser Ala Arg Thr Gly 
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Culture Collection, Rockville, Maryland, under accession number ATCC 38131. The term "coding sequence 1 or 
coding region as used herein also encompasses any functional derivative thereof. By the term "functional 
■II J! , is , mean ? a codin 9 sequence with amino acid alterations which, where appropriate, don't Interfere 

with particle formation and/or which retain immunogenic^ where such retention is desirable. Such functional 
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region of DNA which functions to maintain stably, the extrachromosomally, such recombinant vector in a host 
eukaryotlc or prokaryotic organism transformed with such vector. Such replicons are well-known in the art Bv 
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sequences necessary for transcription of a coding sequence which regulates transcription of a structural 
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gene's coding sequence. Such regulatory regions are well-known in the art. Such a vector can be prepared by 
conventional techniques such as by inserting the Pre S1- Pre S2-S-protein coding sequence of this invention 
in phase into a vector already containing a repiicon and regulatory region in such a way that the DNA molecule © 
is operatively linked to such regulatory region. Preferably the DNA molecule is ligated to a vector capable of 

5 expression in yeast such as Saccharomyces . 

This invention also relates to a host cell transformed with such recombinant vector and to a method of 
preparing such transformed host. Such transformation is carried out by conventional techniques. Host cells 
can be prokaryotic or eukaryotic. Preferred host cells are eukaryotic expecially yeast cells and include those 
belonging to the genus Saccharomyces , particularly those belonging to the species Saccharomyces 

10 cerevisiae and Saccharomyces carlsbergensis . 

This invention also relates to a method of preparing a protein encoded by the Pre S1-Pre S2-S protein 
coding sequence of this invention wherein such method comprises culturing the transformed host cells of this 
invention in appropriate culture media, and isolating the protein from the host cell culture fluid or from a cell 
lysate or extract of such host's culture depending upon the host cell's ability to secrete such protein. By 

15 "appropriate culture media" is meant that media which will enable the transformed host of this invention to 
survive and which will also enable such host to express the Pre-S1 Pre S2-S protein coding sequence product 
of this invention in recoverable quantity. It will be appreciated by one of skill in the art that the appropriate 
culture media to use will depend upon the host cell employed. The isolation of the Pre S1-Pre S2-S protein of 
this invention from a culture lysate or the culture fluid of such host is carried out by conventional protein 

20 isolation techniques. The protein prepared by the method of this invention may be glycosylated depending » 
upon the host cell's ability to do so. If a deglycosyfated protein is desired, the protein of this invention should 
be prepared by utilizing a host cell incapable of effecting such glycosylation or by employing conventional site 
specific mutagenesis on the protein's coding sequence prior to carrying out the process of this invention, 
such as by the method of Botstein et al., Science , 229 , 1193-1201 (1986). The Pre S1-Pre S2-S coding * 

25 sequence of this invention will be expressed as a glycosylated protein by a yeast or mammalian host cell 
transformed therewith. The Pre S1-Pre S2-S coding sequence of this invention will be expressed as a 
N-myristylated protein by a yeast host cell transformed therewith. 

This invention also relates to a vaccine containing an immunoprotective amount of the Pre S1- Pre S2-S 
protein of this invention. Such vaccine may be prepared by conventional techniques. 

30 This invention also relates to a. method of preparing a hybrid particle containing HBsAg protein which 
comprises protein encoded by the Pre S1-PreS2-S protein coding sequence of this invention wherein such 
method comprises culturing the transformed eukaryotic host cells of this invention in appropriate culture 
media, and isolating the particle from the host cell culture fluid or from a cell lysate or extract of such host's 
culture depending upon the transformed host cells ability to secrete such protein. By "appropriate culture 

35 media" is meant that media which will enable the transformed host of this invention to survive and which will 
also enable such host to express the pre S1-Pre S2-S protein coding sequence of this invention in particle 
form and in recoverable quantity. It will be appreciated by one of skill in the art that the appropriate culture 
media to use will depend upon the host cell employed. The isolation of the particle of this invention from a 
culture lysate or the culture fluid of such host is carried out by conventional isolation techniques. 

40 This invention also relates to a vaccine containing the hybrid particles of this invention. Such vaccine will ? 
contain an immunoprotective quantity of the hybrid particles of this invention and are prepared by conventional 
techniques. Preferably, the vaccine of this invention comprises the micelle of this invention, i.e., a micelle 
comprising the particle of this invention and polysorbate. The preferred amount of polysorbate comprised by 
such micelle is 5-50 jig polysorbate per 100 \ig protein. The micelle may be prepared by the method disclosed * 

45 in European Patent Application Publication No. 0 199 698. 

in the vaccine of the invention, an aqueous solution of the Pre S1-Pre S2-S protein or the hybrid particle of 
the invention, preferably buffered at physiological pH, can be used directly. Alternatively, the protein or particle 
can be admixed or adsorbed with any of the various known adjuvants. Such adjuvants Include, among others, 
aluminum hydroxide. 

50 Vaccine preparation is generally described in New Trends and Developments in Vaccines, edited by Voller et 
al., University Park Press, Baltimore, Maryland, U.S.A., 1978. Encapsulation within liposomes is described, for 
example, by Fullerton, U.S. Patent 4,235,877. Conjugation of proteins to macromolecules is disclosed, for 
example, by Ukhite, U.S. Patent 4,372,945 and by Armor et al., U.S. Patent 4,474,757. 
The amount of the Pre S1-Pre S2-S protein or hybrid particle of the present invention present in each 

55 vaccine dose is selected as an amount which induces an immunoprotective response without significant, 
adverse side effects in typical vaccines. Such amount will vary depending upon which specific immunogen is 
employed and whether or not the vaccine is adjuvanted. Generally, it is expected that each dose will comprise 
1-1000 jig of protein, preferably 1-200 p.g. An optimal amount for a particular vaccine can be ascertained by 
standard studies involving observation of antibody titres and other responses in subjects. Following an Initial 

60 vaccination, subjects will preferably receive a boost in about 4 weeks, followed by repeated boosts every six 
months for as long as a risk of infection exists. 

The immune response to the Pre S1-Pre S2-S protein or hybrid particle of this invention is enhanced by the 
use of adjuvant such as, but not limited to, alum. 

65 9 
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EXPRESSION OF PRES1-PRES2-S PROTEIN IN YEAST 

This invention also relates to a recombinant DNA vector comprising an entire PreS1-PreS2-S protein coding 
sequence operatlvely linked to an expression control sequence. The term 'coding sequence" or 'coding 
region" as used herein also encompasses any functional derivative thereof. By the term 'functional derivative" 
is meant a coding sequence with amino acid alterations which, where appropriate, don't Interfere with particle 
formation and/or which retain immunogenicrty where such retention is desirable. Such functional derivatives 
can be prepared by conventional site specific mutagenesis such as by the method of Botstein et al Science 
229 1193-1201 (1985). Such vector may additionally comprise a replicon depending upon preferen ce and/or 
the host to be employed. Preferably, the PreS1-PreS2-S protein coding sequence employed Is the one of this 
■ invention. 

By the term "replicon" is meant that minimum region of DNA which functions to maintain stably and 
extrachromosomally, such recombinant vector In a host yeast organism transformed with the vector of this 
invention. Such replicons are well-known In the art. By the term 'regulatory region' is meant any DNA 
sequence which contains a promoter region and other sequences necessary for transcription of a coding 
sequence which regulates transcription of a structural gene's coding sequence. Such regulatory regions are 
well-known in the art. Such a vector can be prepared by conventional techniques such as by Inserting a PreS1- 
PreS2-S protein coding sequence in phase Into a vector already containing a replicon and regulatory region In 
such a way that the DNA molecule is operatlvely linked to such regulatory region. Preferably the DNA molecule 
Is ligated to a vector capable of expression In the yeast genus Saccharomyces . Preferably the Pre S1- Pre S2-S 
protein coding sequence contained by the vector of this invention is the Pre S1- Pre S2-S protein coding 20 
sequence of this invention described hereinabove. Such vectors are also useful for the Insertion therein of 
functional DNA coding sequences to enable expression of the resulting Pre S1 -functional DNA coding 
sequence-Pre S2-S protein coding sequence comprised by such vector. Therefore, this Invention also relates 
to a recombinant DNA molecule comprising a functional DNA coding sequence fused In phase Into the Pre S1 
region of the Pre S1-Pre S2-S protein coding sequence. In such a way that the functional DNA sequence either 
forms an insertion in the Pre S1 region or replaces part of the Pre S1 region and possibly a portion of the Pre 
S2 region, and to a vector comprising such recombinant DNA molecule. See. for example. Example 21 A infra. 
By the term 'functional DNA coding sequence" is meanta DNA coding sequence which, when fused. In phaie" 
nto the Pre S1 region of the Pre S1-Pre S2-S protein coding sequence, is not involved with, and does not 
Interfere with, the assembly of the HBsAg particle. Preferred functional DNA coding sequences include but are 
not limited to. the circumsporozoite coding sequence of Plasmodium or any immunogenic derivative thereof 
the envelope gene coding sequence of HIV or any Immunogenic derivative thereof, especially the C7 peptide' 
peptide 121 or Dreesman peptide coding sequence as hereinafter exemplified, or sequences coding for 
peptides of interest from other viruses expeclally those for coat proteins or for surface proteins from other 

This invention also relates to a yeast host cell transformed with such recombinant vector and to a method of * 
prepanng such transformed host. Such transformation Is carried out by conventional techniques. Preferred 
yeas ceHs and include those belonging to the genus Saccharomyces . particularly those belonging to the 
species Saccharomyces cerevislae . 

This Invention also relates to a method of preparing a protein encoded by a Pre S1-Pre S2-S protein codlna 40 
sequence or Pre S1-functional DNA coding sequence-Pre S2-S protein coding sequence wherein such 
method comprises culturing the transformed host cells of this invention in appropriate culture media, and 
Isolating the protein from the host cell culture fluid or from a cell lysate or extract of such host's culture 

SSh^?^ 0 !! 0 ! 11 ' 8 !! 511 ^ such P rote| n- By 'appropriate culture media' Is meant that 

media which will enable the transformed host of this invention to survive and which will also enable such host 45 
to express a Pre S1-Pre S2-S protein coding sequence or Pre S1-functional DNA coding sequence - Pre S2-S 
protein coding sequence product in recoverable quantity. It will be appreciated by one of skill in the art that the 
appropriate culture media to use will depend upon the host cell employed. The isolation of the Pre S1-Pre S2-S 
protein or Pre S1 -functional DNA coding sequence - Pre S2-S protein from a cell lysate or the culture fluid of 
!2LS"V S £ Ur ! ' S C T 1ed ° Ut by conventlonal P~teln isolation techniques. The protein prepared by the so 
method of this Invention may be glycosylated depending upon the host cell's ability to do so. If a 
deglycosylated protein is desired, the protein of this invention should be prepared by utilizing a host cell 
incapable of effecting such glycosylate or by employing conventional site specific mutagenesis on the 
protein s coding sequence prior to carrying out the process of this invention, such as by the method of 
Botstein et al., Science . 229. 1193-1201 (1985). 

BP D, iS iT, n,iC ." alS ?«!!! tes t0 a vaccine contain 'ng an immunoprotectlve amount of Pre-S1-Pre S2-S protein 
n l DNA ° odin 9 sequence - Pre S2-S protein produced by the method of this invention. Such 
vaccine may be prepared by conventional techniques. 

PrIS? JSc" ™ late !, t0 a method * Preparing a hybrid particle which comprises protein encoded by a 
™!1 P t P° ding sec ' lJence or Pre S1-functlonal DNA coding sequence-Pre S2-S protein 

coding sequence wherein such method comprises culturing the transformed yeast host cells of this invention 
Lvt^° P f riate h c " ltu '; e media - a" 0 " ,sola tmg the particle from the host ceil culture fluid or from a cell lysate or 
* 8 0, i tU . r f de P endlna u P° n the transformed host cells ability to secrete such particles. By 

JKS^hi t k f ' 8 meant th8t media **** wl " enable the transformed host to survive and which 
will also enable such host to express the Pre S1-Pre S2-S protein coding sequence or Pre S1 -functional DNA 
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coding sequence - Pre S2-S protein coding sequence in particle form and in recoverable quantity. It will be be 
appreciated by one of skill In the art that the appropriate culture media to use will depend upon the host cell 
employed. The isolation of the particles of this invention from a culture lysate or the culture fluid of such host is 
carried out by conventional isolation techniques. 

5 This invention also relates to a vaccine containing the particles of this invention. Such vaccine will contain an 
Immunoprotective quantity of the particles of this invention and are prepared by conventional techniques. 
Preferably, the vaccine of this invention comprises the micelle of this invention, i.e., a micelle comprising the 
particle of this invention and polysorbate. The preferred amount of polysorbate comprised by such micelle is 
5-50 p.g polysorbate per 100 p.g protein. The micelle may be prepared by the method disclosed in European 

10 Patent Application Publication No. 0 199 698. 

In the vaccine of the invention, an aqueous solution of the Pre S1-Pre S2-S protein or Pre S1 functional DNA 
coding sequence - Pre S2-S protein or the particle of the invention, preferably buffered at physiological pH, 
can be used directly. Alternatively, the protein particle can be admixed or adsorbed with any of the various 
known adjuvants. Such adjuvants include, among others, aluminum hydroxide. 

15 Vaccine preparation is generally described in New Trends and Developments in Vaccines, edited by Voller et 
al., University Park Press, Baltimore, Maryland, USA, 1978. Encapsulation within liposomes is described, for 
example, by Fuilerton, U.S. Patent 4,235,877. Conjugation of proteins to macromolecules Is disclosed, for 
example, by Likhite, U.S. Patent 4,372,945 and by Armor et al., U.S. Patent 4,474,757. 
The amount of the Pre S1 -Pre S2-S protein or Pre S1-functional DNA coding sequence - Pre S2-S protein or 

20 particle of the present invention present in each vaccine dose is selected as an amount which induces an 
immunoprotective response without significant, adverse side effects in typical vaccines. Such amount will vary 
depending upon which specific immunogen is employed and whether or not the vaccine is adjuvanted. 
Generally, it is expected that each dose will comprise 1-1000 \ig of protein, preferably 1-200 jig. An optimal 
amount for a particular vaccine can be ascertained by standard studies involving observation of antibody titres 

25 and other responses in subjects. Following an initial vaccination, subjects will preferably receive a boost in 
about 4 weeks, followed by repeated boosts every six months for as long as a risk of infection exists. 

The immune response to the Pre S1-Pre S2-S protein or Pre S1-functional DNA coding sequence - Pre S2-S 
protein or particle of this invention is enhanced by the use of adjuvant such as, but not limited to, alum. 

30 EXAMPLES 

In the following examples of the invention, which are illustrative and not limiting 

- All percentages are by weight and all temperatures are in degrees Celsius (Centigrade). 

* The enzymes used in DNA manipulations were obtained from Bethesda Research Laboratories, New England 
Biolabs, and/or Boehringer, and were employed according to the suppliers' directions. 
35 - Yeast growth media : selective medium : YNB (yeast nitrogen base without amino acids (Difco Labs) 0.675 % 
(w/v) containing 2% (w/v) glucose). 

- YEPD medium : 1 % (w/v) yeast extract, 2 % w/v peptone and 2 <Vb (w/v) glucose. 

- DNA recombinant methods are described in "Molecular Cloning", T. Maniatis et al., Cold Spring Harbor Lab. 
(1982). 

40 - PMSF : phenyl methyl sulphonyl fluoride 

- The following abbreviations may be employed: 

PBS : Phosphate Buffered Saline (per liter): (pH 7.4) 
8.0 gram NaCi 
45 0.2 gram KCI 

1.15 gram Na2HP04 
0.2 gram KH2PO4 
0.1 gram CaCfe 
0.1 gram MgCl2 • 6H2O 

50 

BSA: Bovine Serum Albumin (A4378, Sigma) 

X-gal : 5-bromo-4-chIoroindoyl-P-D-galactopyranoside, commercially available from Sigma Chemical Co., St. 
Louis, Missouri. 

55 L-BROTH (per liter): 
10 gram Tryptone 
5 gram NaCI 
5 gram Yeast extract 
1 ml 0.1M MgS04 

60 Add 5 ml of aseptic solution of thiamine (1 mg/ml) after autoclaving. For solid media, add 15 g agar per liter. 



Example 1 . Construction of plasmid pRiT10167. 

The starting material was plasmid p6y containing a 2.1 kb Hind lll fragment of yeast DNA encoding the TDH3 
gene cloned on pBR322. pBR322 is described by Bolivar et al. (Gene, 2, 95-1 13, 1977). The p6y plasmid was * 
65 constructed and characterized by Musti et al. (Gene, 25, 133, 1983) andfeceived from Dr. M. Rosenberg of the 
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National Institute of Health. The Hindlll fragment was recloned onto a pBR322 derivative In which the EcoRI site 
had been destroyed to give plasmid pRrT10164 (a non essential manouver). The following manipuiatTonswere 
carried out so as to create a Bam HI site located at the Q residue of the ATG codon of the TDH3 coding 
sequence and to obtain a Hindlll-BamHI DNA fragment with TDH3 promoter activity In which the TDH3 
sequences are intact and unchanged by the manipulation. DNA of pRITIOiW, prepared as described above, 
was purified by two cycles of CsCI - ethidlum bromide density gradient centrtfugation essentially as described 
by Kahn et al. (Methods in Enzymology, 68, 268, 1979). 150 fig of pRIT10164 DNA were digested to completion 
with 75 units of Xbal endonuclease, extracted with phenol and ether, precipitated with ethanol and the DNA 
resuspended In 0.01 M tromethamlne-HCI buffer at a concentration of 1 fig per pi. Samples of 20 tig of this Xbal 
treated DNA were digested with BaI31 nuclease to resect the 61 base pairs of DNA between the ATG codon 
and the Xbal site. Dlgestons with Bal31 were carried out at 30° for 1 to 3 minutes In buffer containing 600 mM 
NaCI, 12 mM CaCI2, 12 mM MgCI2, 1 mM EDTA, 20 mM tromethamlne-HCI, pH3.1 using one unit of B531 
nuclease per 20 \ig DNA in a final reaction volume of 200 pj. — 

The enzyme reactions were stopped by addition of ethylenebls (oxyethylenemtrilo) tetraacetic acid (EGTA) 
to give 20 mM final concentration and the samples were extracted with equal volumes of phenol, ether and 
precipitated with ethanol. Each DNA sample was resuspended In 20 jil 10 mM tromethamine-HCI buffer pH 7.5. 
The extent of Bal31 digestion was measured by digesting about 2.5 \ig of each DNA sample with Hpal 
endonuclease and comparing the size of the Hpal-Xbal fragments to the 335 bp Hpal-Xbal fragment from 
PRIT10164. A 2 minute digestion with Ba}31 was found to remove about 41 to 88 nucleotides from the 
Xbal-Hpal fragment of pRIT10164. 

This result indicates that a similar number of base pairs had been removed from the other Xbal extremity 
towards the ATG codon. 5 ng of the DNA recovered from the 2 minute Ba|31 digest was dlgestedwith BamHI 
endonuclease, extracted with phenol and recovered by ethanol precipitation. This DNA was treated with 5 
units of T4 polymerase in the presence of deoxynucleotide triphosphates to fill In and render flush ended the 
BamHI and any BaJ31 single stranded extremities, extracted with phenol and recovered by ethanol 
precipitation. 2.3 \xg of this DNA was treated with 5 units of T4 DNA ligase and half of the ligation mixture used 
to transform competent ceils of E. coli K12 strain MM294 prepared according to the method of Cohen et al. 
(Proc. Natl. Acad. ScI. 69. 2110, 1972). One ml of the transformed E. coli population was diluted Into 350 ml 
L-broth with 200 jig/ml ampicillin and total plasmid DNA was purified from the resulting culture. 

80 \iq of this plasmid DNA containing a population of plasmid molecules with BaI31 Induced deletions of 
about 40 to 80 base pairs was digested sequentially wigh 75 units of Hind lll endonuclease and 96 units of 
BamHI endonuclease to release DNA fragments from those plasmids in which a Bam HI site had been 
recreated after the treatments described above, that is, where the Ba|31 digestion had terminated at a G 
residue. This DNA digest was run on a preparative Wo agarose gel and the desired HlndlH-BamHI fragments 
corresponding to sizes from 1 100 to 1000 bp excised from the gel in two slices, one corresponding to DNA of 
about 1070 bp and the other of about 1030 bp. The DNA was recovered from the two agarose gei slices by 
cycles of freezing and thawing followed by high speed centrlfugatlon to pellet the agarose. The supernatant 
liquid was filtered through a Mliiipore GV Mlllex filter and the DNA recovered by two cycles of ethanol 
precipitation and resuspended in 20 jii of 0.01 M tromethamine-HCI buffer pH 7.5. 

Analysis by agarose gel electrophoresis and comparison to a Hind lll plus Xbal digest of pRIT10164 DNA and 
to the fragments from a Hindlll plus EcoRI digest of phage X DNA showed that two distinct populations of 
Hind lll -Bam HI fragments were obtained, one with an estimated size of about 1070 bp and one with an 
estimated size of about 1030 bp compared to the 1120 bp parental Hlndlll-Xbal fragment from pRIT10164. 
About 1 00 ng of the 1 030 bp Hindlll-BamHI fragment population was llgated with 200 ng of plasmid pUC9 which 
had been digested with Hindlll and Bam HI endonucleases and treated with alkaline phosphatase (See, Shine, 
U.S. Patent No. 4,264,731) and the ligation mixture was used to transform competent cells of E. coli strain* 
JM103 with selection being made for ampicillin resistance. Transformation of E. coli strain JM103 waseffected 
according to the method of Cohen et al., cited above. 

Both the pUC9 vector plasmid and the recipient E. coll strain JM103 have been described by Vlelra and 
Messing In Gene, 19, 259, 1982 and were obtained from J. Messing (Univ. of Minnesota). The pUC9 vector Is 
commercially available from Amersham (Buckinghamshire, United Kingdom) and Pharmacia (Uppsala, 
Sweden). E. coji strain JM103 is available from J. Messing (Univ. of Minnesota). Another E. coil strain with the 
characteristics of E. coll strain JM103, I.e., JM101 , can be obtained from the American Type Culture Collection, 
Rockville, Maryland, under accession number ATCC 33876, and can also be employed as host. About 400 
ampicillin resistant colonies per ml were obtained and of 98 colonies tested on medium containing X-gai. 95 
were white indicating the successful insertion of a foreign DNA fragment between the Hindlll and BamHI sites 
on the vector. 

Plasmids from individual transformant colonies were prepared by a small scale procedure (Birnbolm and 
Doty, Nucl. Acid Res., 7, 1513, 1979) after amplification of the plasmids by addition of spectinomycin (150 
ng/ml) to the growing cultures. 

The recombinant plasmids were analyzed on 7.5<Vo acrylamide gels after double digestion with Avail and 
BamHI endonucleases and compared to the 450 bp AyallXbal fragment comprising the promoter-N^termlnus 
coding region of pRIT10164 and to the fragments of a Hpall digest of pBR322 DNA. An Avall-BamHI fragment 
corresponding to deletions of from 20 to 90 base pairs was present In 35 of 36 plasmids examined when 
compared to pRIT10164. Three plasmids representing deletions of about 80 base pairs (pRIT10166) 50 base 
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pairs (pRIT10167-Figure 8) and 45 base pairs (pRIT10165) were chosen for further study. Plasmid DNA was 

prepared by CsCI-ethidium bromide density centrifugation and 25 \ig of each digested with EcoRI. The EcoR I 

extremities were labelled with y- 32p -ATP by exchange kination and the nucleotide sequence of the s 

Hind lll -EcoR I fragments from each plasmid determined by the chemical modification methods of Maxam and 

5 Gilbert (Methods in Enzymology, 65, 499, 1980). Labelling of and sequencing from the EcoR I site in the pUC9 
vector portion of each recombinant plasmid is a convenient means of determining the sequence around the 
adjacent BamH I site marking the end of the deletion into the TDH3 DNA frag- ment. This sequencing analysis 
showed that in plasmid pRfri0166, the BamHI site is recreated at a G residue located in the 5' non translated 
region 25 base pairs upstream of the ATG codon ; in pRIT1 01 65 the BamH I site is located at the second base of 

10 the third codon; and In pRIT10167 the BamH I site is located at the G residue of the ATG codon. 

The Hindlll-BamHI fragment of TDH3 DNA thus constructed in pRIT10167 contains all the sequences 
necessary for TDH3 promoter activity in an unaltered form. 

Example 2. Construction of vector pRIT10172. 

15 The 1050 bp Hlndlll -Bam HI TDH3 DNA insert from pRIT10167 ( prepared as described in Example 1 ( was 
recioned onto the YEp1 3 shuttle vector described by Broach etal.(Gene f 8. 121, 1979) between the Hindlll site 
In the 2 micron DNA portion of the vector and the BamH I site to give the recombinant plasmid pRIT1 21 59. DNA 
of pRIT12159 was then digested with Xbal and BamH I endonucleases and the resulting 1650 bp fragment was 
ligated in place of a similar Xba l -BamH l fragment containing the arg 3 promoter on plasmid pRIT10774. Plasmid 

20 pRIT10774 is described by Cabezon et al. (Proc. Natl. Acad. Sci. U.S., 81, 6594-6598, 1986), and comprises the ? 
YEp13 repiicon from which the natural vector's BamH I and Xhol sites have been deleted by in vitro 
manipulation together with a 1470 bp Hind lll -BamH I insert comprising the arg 3 promoter region and a 1150 bp 
BamJHI-Hindlll fragment comprising the arg 3 transcription termination region. Replacement of the arg 3 
promoter insert on pRIT10774 by the TDH3 promoter insert gives plasmid pRIT10172 which therefore contains f 

25 a single BamH I site located at the ATG codon of the TDH3 promoter insert and preceeding a functional signal 
for transcription termination on the 1150 bp BamH l -Hind lll arg 3 DNA fragment. Foreign DNA can therefore be 
cloned at this site. Furthermore the two DNA inserts are bounded by Hindlll sites enabling removal of the 
expression unit module (cassette) as a 2200 bp Hind lll fragment for insertion on other vectors. Figure 9 is a 
restriction endonuclease cleavage map of pRIT10172. 

30 

Example 3 . Construction of pRIT12290. 

A 1050 bp Hindlll -EcoR I fragment from pRIT10167 (Figure 8), prepared as described in Example 1, and 
containing the Hindlll -BamHI TDH3 promoter fragment together with the BamH l -Sma l -EcoR I part of the pUC9 
polylinker was recioned between the Hind lll and EcoR I sites of a pBR322 derivative plasmid previously deleted 
35 fro the BamH I site by filling in with T4 DNA polymerase and reiigation. The resulting recombinant plasmid was 
identified as pRfT12176. 

Plasmid DNA of pRIT 10158 (Figure 8), prepared as described in Example 4 (b), below, was digested with 
EcoR I endonuclease and treated with T4 DNA polymerase to fill in the EcoR I single strand extensions. This 
preparation was further digested with Clal endonuclease. A further sample of pRIT10158 DNA was digested 

40 with Clal and Haeill endonucleases and a 1150 bp ClalHaelil fragment carrying the transcription termination • 
region of the arg 3 gene was recovered by preparative agarose gel electrophoresis and electroelution. This 
fragment wasligated with the EcoR I, T4 DNA poly merase, Clal treated pRIT10158 DNA, and the ligation 
mixture was used to transform competent cells of E. coli strain MM294 prepared as described by Cohen et al., 
above, to ampiciliin resistance. From the transformed colonies, a plasmid, identified as PRIT10162, was f 

45 isolated in which the Clal-Haelll arg 3 transcription termination fragment was inserted on pRIT10158 between 
the Clal and filled in EcoR I sites and in which the EcoR I site has been recreated. Figure 8 is a flow sheet which 
Illustrates the preparation of pRIT10162. Plasmid DNA of pRIT10162 was digested with EcoR I and Pstl 
endonucleases, mixed with plasmid DNA of pRIT12176, prepared as described above, (also digested with 
EcoR I and Pstl endonucleases) and the mixture was ligated and used to transform cells of E. coliK12 strain 

50 MM294 to ampiciliin resistance by the methods of Cohen et al., cited above. A plasmid, pRIT12208, was 
recovered from a colony from this transformation in which the large 4630 bp EcoRI-Pstl fragment from 
pRIT12176 has been ligated to the small 1930 bp EcoR I Pst l fragment from PRIT10162, and in which the arg 3 
transcription termination region is now juxtaposed to the TDH3 promoter. The arg 3 and TDH3 DNA regions can 
be excised from pRIT12208 as a single 2200 bp fragment by digestion with Hind lll endonuclease. To obtain the 

55 other orientation of this fragment with respect to vector sequences the 2200 bp Hind lll fragment from 
pRIT12208 was recioned into the Hind lll site of a pBR322 derivative plasmid in which the natural EcoR I and 
BamH I sites had separately been deleted by filling in with T4 DNA polymerase and reiigation. The resulting 
plasmid, with the 2200 bp Hind lll fragment inserted in the opposite orientation with respect to vector 
sequences compared to pRIT12208, is identified as pR!T12290 and deposited as ATCC 67186 on August 18, 

60 1986 at the ATCC in accordance with Budapest Treaty's regulations. Figure 10 is a restriction endonuclease 
cleavage map of pRIT12290. In both the pRIT12208 and pRIT12290 vectors, the TDH3 promoter and are| 3 
transcription termination fragments are separated by unique BamH I, Sma l and EcoR I restriction sites useful 
for the cloning of DNA fragments carrying coding sequences, and the promoter and transcription termination 
regions can be excised together on a 2200 bp Hind lll fragment as a module or cassette for transfer to other # 

65 vectors. 
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The BamH I site is especially useful as it Is located at the ATG codon of the TDH3 gene and can be employed 
for fusion of other genes to the TDH3 promoter with a view to expressing them In yeast as exemplified below 
Such fusions can be recovered from pRIT12208 or pRIT12290 derivatives in the form of cassettes or modules 
for insertion on yeast shuttle vectors by digestion with Hlnd lll endonuclease or by the use of other restriction 
endonucleases cutting at flanking restriction sites. 6 

Example 4. Construction of plasmids ^1110677, pRIT10909 and pRIT10158. 

a) pRIT10677 

A 1372 bp BamH I fragment of cloned HBV DNA was excised from pRIT10616 (Harford et el, Dev. Biol w 
Standard, 54, 125-130, 1983) and mixed and ligated with BamH I endonuclease digested pBR327 DNA. This 
BamHI fragment contains part of the HBsAg precursor region, the HBsAg coding sequence and 3' non coding 
sequences. From the ligation mixture a plasmld, pRIT10677, was selected which was the HBV BamH I fragment 
inserted at the BamH I site of the vector in the orientation such that the Xba1 site in the HBV DNAiTproximal to 
the Sail site In the pBR327 vector. Figure 8 is a flow sheet which illustrates the preparation of pRIT10677. 15 

b) pRIT10909 

The 3300 bp Hindlll fragment from plasmid pMC200 (available from the American Type Culture Collection 
Rockville, Maryland, U.S.A., without restriction under accession number ATCC 39131), described by Crabeei 
M. et al (EMBO J., 2, 205-212, 1983), was recloned onto a pBR322 derivative plasmld on which thfe natural 20 
EcoRI site had been deleted by filling In with T4 DNA polymerase and religation. A resulting recombinant 
plasmid, pRIT10158 (Figure 8), was selected In which the Hind lll arg 3 gene Insert has the 3' region of the arg 3 
gene proximal to the Cla1 site on the modified pBR322 vector. From PRIT10158, a 1 150 bp ClalHaelll fragrfr^t 
was recovered which carries the transcription termination region of the arg 3 gene. This 1 150 bpfragment was 
ligated with Cla1 and Hga1 endonucleases digested DNA of plasmld pRIT10677, (prepared as described in Part 25 
a), above, to fuse the arg 3 termination region at the Hga1 site downstream of the HBsAg coding region. The 
resulting plasmid is pRIT1 0909. Figure 8 Is a flow sheet which illustrates the preparation of pRIT1 0909. Figure 3 
is a restriction endonuclease cleavage map of pMC200. 

Example 5. Construction of plasmids pRIT10911 and pRJT10912. qq 

TTie BamH I site on pRIT10167 (Example 1) and the natural BamH I site located in the precursor region of the 
HBsAg gene from a HBV virus of adw serotype cloned on pRIT10616 (Harford et al., Dev. Biol. Standard, 54, 
125-130, 1983) are in the same translatlonal reading frame enabling the construction of a fusion of the TDH3 
promoter fragment which supplies the ATG codon to 42 codons of the HBsAg precursor sequence followed by 
226 codons of the HBsAg coding sequence proper. The source of the HBV DNA fragment for this fusion was a 35 
plasmid pRIT10909 (see, Example 4, part b, above) containing a 935 bp BamH I Hpa l Insert encoding part of the 
HBsAg precursor region, the full HBsAg coding sequence and 128 bp of 3' non translated DNA to which had 
been fused, downstream of the Hpa I site, an 1150 bp Haelll-Cjaill DNA fragment from pRIT10158 containing 
the arg 3 transcription termination region. pRIT10158 Is described In Example 4, part B, above. The 2085 bp 
EcoRI-BamHI fragment was excised from pRIT10909 and inserted between the EcoRI and BamH I sites of 40 
PRIT10167, prepared as described In Example A, so as to create plasmld pRIT10911. The 3135 bp Hind lll 
fragment from pRIT10911 containing the yeast DNA transcriptional signals and the HBsAg coding sequences 
was then inserted on the YEp13 shuttle vector to give plas- mid pRIT10912. Figure 8 Is a flow sheet illustratlnq 
the preparation of pRIT10911. 
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Example 6. Construction of plasmids pRIT10903 and pRIT10914 

The aim of the constructions described below was to remove excess nucleotides from the 5' end of a DNA 
fragment encoding at least the N-terminal portion of the mature HBsAg coding sequence in such a fashion as 
to create a 6 base pair restriction site at the first base of the second codon. The resulting fragment can then be 
fused to promoter fragments with a 6 base pair restriction site at the initiation codon using Mung Bean or S1 so 
nucleases to trim away the single strand extensions and create blunt ends for ligation as shown by Rosenberg 
et al., Methods in Enzymoiogy, 101C , 123 (1983). 

A 1225 bp FnuDII fragment of HBV DNA from pRIT10616 (See, Harford et ai. (1983) Devel. Biol. Standard, 54, 
125-130) and encoding the HBsAg gene was excised from this plasmid and ligated with synthetic octomer 
EcoRI linkers, and the fragment cloned into the EcoR I site of vector pACYC184 to give pRiT10679 (Figure 8) 55 
Vector pACYC184 was received from S. Cohen and is described by Chang A.C.Y. and Cohen S. (1978), J 
Bacterid., 134, 1141- 1156. pACYC184 is also available from the American Type Culture Collection Rockville 
Maryland, U.S.A., under accession number ATCC 37033. From this EcoR I ( FnuD II) fragment,' a 125 bp 
EcoR I-Xbal fragment was obtained which contains the Cterminal region of the HBsAg precursor, the HBsAg 
ATG codon and 90 bp of HBsAg coding sequence. This 1 25 bp EcoRI-Xbal fragment was introduced between eo 
the EcoRI and Xbal sites of pRIT10158 (see, Example 4 (b), above). The resulting plasmid, pRIT10903 
therefore contains a sin- gle EcoR I site located at the junction between arg 3 DNA and HBV DNA and has a 
single Xhol site located in the arg 3 promoter region. Figure 8 Is a flow sheet which illustrates preparation of 
pRIT109G3. 

150 ^g of pRIT10903 plasmid DNA, prepared by CsCI-ethidium bromide density gradient centrtfugatlon, was 65 
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digested with 80 units of EcoR I endonuclease, extracted with phenol, ethanol precipitated and resuspended 

at 1 jxg per jjlI in 10 mM tromethamine-HCI buffer (pH 7.5). The DNA was divided into three 50 \ig samples and 

incubated for 50, 75 and 90 seconds with Bal31 nuclease at 30°. Each reaction mixture contained the same * 

reaction buffer cited in Example 1, 50 \lq of EcoR I digested pRIT10903 DNA, and 2.5 units of Bal31 nuclease in 

5 a final volume of 500 The reactions were stopped by addition of EGTA to 20 mM final concentration, chilling 
on ice, extraction with an equal volume of phenol and precipitation with etha nol. The Bal31 treated DNA 
samples were each taken up in 50 (il 10 mM tromethamine-HCI buffer and 2.5 jig digested with BstE II 
endonuclease and compared on a 7.5°/o acrylamide gel to an EcoRI plus BstE II endonuclease digest of 
pRlT10903 which releases a 285 bp fragment. Treatment with Bal31 nuclease for 75 seconds at 30° was found 

10 to reduce the size of this EcoRI BstEII fragment by 10 to 60 base pairs giving a series of discrete bands 
estimated to represent removal of 10,30,45 and 60 base pairs of DNA. Resection of 29 base pairs from the 
original FnuD II site would remove all the HBsAg precursor DNA and the HBsAg ATG codon. 

5 jig of pRIT10903 DNA which had been treated with Bal31 for 75 seconds was digested with 8 units of Xhol 
nuclease, extracted with phenol and ethanol precipitated. This DNA was then incubated with 5 units of T4 DNA 

15 polymerase in the presence of deoxynucleotide triphosphates to fill in and render flush ended the Xhol and 
Bal31 extremities, treated with phenol and ethanol precipitated. The DNA was then incubated with 5 units of T4 
DNA ligase for 16 hours at 16° and the mixture used to transform competent cells of E. coll K12 strain MM294 
to ampicillin resistance according to the method of Cohen et al., cited above. About 2000 ampiciilin resistant 
colonies per ml were recovered after plating. Plasrnids were prepared from 47 individual colonies by a small 

20 scale procedure (See, Birnboim et al, cited above), and 23 of 47 were found to retain a Xho l site indicative of * 
successful filling in of the original Xho l site and ligation to a 3' end terminating in a G residue. These 23 
plasrnids were further digested with Xhol and Xba l endonucleases to determine the size of the small restriction 
fragment in comparison with the original EcoR I -Xba l fragment from pRIT10903. 
Several plasrnids were identified with deletions estimated to range from 25 to 40 base pairs. One plasmid, * 

25 pRIT10914, which was estimated to contain a Xho l Xba l fragment of 95 to 100 base pairs was selected for 
further study. Figure 8 is a flow sheet which illustrates preparation of pRIT10914. DNA of this plasmid was 
purified by CsCiethidium bromide density gradient centrifugation and 25 of such DNA was digested with 
140 units of Xbal endonuclease. 
The Xba l terminii were exchange labelled with y- 32 P-ATP, and the labelled DNA was digested with 15 units of 

30 Hind lll endonuclease. The small 765 bp Hind lll -Xba l fragment was isolated by acrylamide gei electrophoresis 
and electroelutlon followed by ethanol precipitation. The nucleotide sequence at and around the Xho l site was 
determined by sequencing this fragment from the labelled Xbal terminus using the chemical modification 
methods of Maxam and Gilbert, cited above. The sequence data showed that the Xho l site was located at the 
first base of the second codon of the HBsAg coding sequence. 

35 

Xhol 

CTCGAGAAC 

40 HBsAg nucleotides ? 



This fragment is therefore suitable for fusion to the BamH I extension of the TDH3 promoter fragment on 
pRIT10167 (Example 1) after removal of single stranded ends. 

45 

Example 7. Construction of plasrnids pRlT12211, pRIT12209, pRIT12230, and pRIT12265 

Plasmid pRiT1091 1, prepared as described in Example 5, was used as a scaffold for further manipulations. 
To introduce desirable restriction sites, the plasmid was digested with Bam HI and Xba l endonucleases, and 
the excised fragment replaced by a 2275 BamHl-Xbal fragment from plasmid pRIT10158 (Example 4). This 

50 manipulation results in plasmid pRIT12211 in which the Hind lll -BamH I TDH3 promoter fragment is next to a 
BamH l -Xbal fragment containing a Xho l site followed by an Xba l- Hind lll fragment comprising the C-terminal 
region of the HBsAg coding sequence and 128 bp of 3' non coding DNA fused to the 1150 bp arg 3 
transcription termination region. Figure 8 is a flow sheet which illustrates preparation of pRIT12211. Plasmid 
pRlT12211 was digested with Xho l and Xba l endonucleases thereby removing a 1600 bp fragment which was 

55 replaced by the 94 bp Xho l Xba l fragment of modified HBsAg DNA from pRIT10914, prepared as described in 
Example 5. This 94 bp fragment was purified by acrylamide gel electrophoresis from a Xho l plus Xba l digest of 
PR1T10914 DNA followed by electroelution of the appropriate gel slice and recovery of the DNA by ethanol 
precipitation. 

The plasmid, pRIT12209, resulting from the introduction of the 94 bp fragment (See, Figure 8), was purified 
60 by CsCI-ethrdium bromide density centrifugation, and 50 jig of pRIT12209 DNA was digested with 90 units of 
BamH I endonuclease and 60 units of Xho l endonuclease to remove the 990 bp of extraneous DNA between 
the TDH3 promoter and the HBsAg coding region, and then was phenol extracted and ethanol precipitated. 16 
\ig of this DNA was incubated for 30 minutes at 30° with 20 units of Mung Bean nuclease (PL Biochemicals) In 
a volume of 125 pj. The buffer used for the incubation was 30 mM Sodium acetate (pH 4.6), 250 mM NaCI, 1 mM 
65 2nCl2 and 5% glycerol. The reaction was stopped by the addition of sodium dodecyl sulphate to give 0.2% 
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final concentration and extraction with an equal volume of phenol followed by ethanoi precipitation. An aliquot 
of 0.5 j*q of this Mung Bean nuclease treated sample was incubated with 2 units of T4 DNA ilgase for 16 hours 
at 16° and then digested with 2 units of BamHI endonuclease to destroy any vector molecules which had 
escaped the above treatments. This mixture was used to transform competent cells of E. coli K12 strain 
MM294 with selection for amplclllin resistance according to the method of Cohen et al, citicTabove About 
2000 ampiciliin resistant colonies per ml of transformation mixture were recovered and piasmlds were 
prepared by a small scale procedure (according to the method of Birnbolm et al., cited above) from amplified 
cultures of 24 of these transformants. 

16 of 24 plasmids gave two bands of DNA with sizes to 2700 bp (pUC9 vector) and 3000 bp after digestion 
with Hindlll endonuclease. The 3000 bp band is the size expected for the fusion of the HBsAg coding sequence 
and arg3 termination region to the TDH3 promoter fragment. 

Four candidate piasmlds were taken for further study. DNA of each plasmid was digested with Xbal 
endonuclease and labelled by exchange kination with y^P-ATP followed by digestion with Hindlll 
endonuclease and isolation of the labelled 1190 bp Hindlll-Xbal fragment by electrocution and ethirTol 
precipitation following acrylamide gel electrophoresis. The sequence of each fragment was determined by the 
chemical modification methods of Maxam and Gilbert, cited above. Two plasmids were found to have the 
correct sequence ATGGAGAAC for perfect fusion of the TDH3 promoter region to the HBsAg gene One of 
these piasmlds, pRIT12230, was used in further manipulations. Figure 8 is a flow sheet which illustrates 
preparation of pRIT12230. DNA of this plasmid (pRIT12230) was digested with Hindlll endonuclease and the 
3000 bp fragment containing the HBsAg coding sequence fused to the TDH3 promoter was iigated onto 
shuttle vector YEp13. The resulting plasmid, PRIT12265, was transformed into yeast strain DCS (MATa leu2-3 
Ieu2-1 12, his3, can1-1 1) (obtained from J. Broach (State University of N.Y. Stony Brook) and available without 
restriction from the American Type Culture Collection under accession number ATCC 20630) uslnq the 
transformation procedure of Ito et al., J. Bact , 153 , 163 (1983). 

Example 8. Construction of plasmids pRJT12288 f pRIT12322 

The construction on plasmid pRIT12230 (Example 7) contains 128 bp non coding HBV DNA located 3' 
downstream of the stop codon of the HBsAg structural gene. 

To remove the 128 bp of DNA, about 25 \ig of pR!T12230 l prepared as described In Example 7 above is 
digested with Accl and EcoRI endonucleases. pRIT12230 contains a single Accl site located in the S-gene 
coding sequence 7 nucleotides before the TAA stop codon. The digested DNAwas eiectrophoresed on a m 
agarose gel and the larger 4200 bp vector fragment was recovered from the gel by electrocution and ethanoi 
precipitation. 

Artificial DNA linker molecules composed of a 12-mer strand and a 14-mer strand with the following 
sequences are synthesized for insertion between the Accl and EcoR I sites of pRIT12330: 

5' ATACATTTAACG 3' 

12 met 

3 ' TGTAAATTGCTTAA 5 ■ 

14 raer 



This will restore the correct C-terminal HBsAg codons, the TAA stop codon and provide an EcoRI site not 
present elsewhere in the TDH3 promoter of HBsAg coding sequences for addition of transcriptiolTtermination 
fragments. 

100 pmoles of the synthetic 12 mer and 100 pmoles of the synthetic 14 mer single strands are mixed 
together in a final volume of 10-20 uJ, heated at 70° for 15 minutes, and allowed to return slowly to room 
temperature over a period of 3 hr permitting the annealing of the two strands along their complementary 
8 ? q ?™ '1° ! his anneaIed mlxture is added ^out 0.15 pmole (400 ng) of the 4200 bp Accl-EcoRI fragment 
of pRIT12230, 10x concentrated ligation buffer and 2 units of T4 DNA Ilgase. 

This mixture is incubated for 4 hours at 16° before the addition of a further 2 units of T4 DNA ligase and then 
incubated overnight on ice. The Iigated mixture is used to transform competent cells of strain MM294 to 
ampiciliin resistance according to the method of Cohen et al. (Proc. Natl. Acad. Scl. USA 69 2110 1972) 
Plasmids are prepared from 12 or more IndMduaJ colonies by the rapid procedure of Birnbolm and Doiy (Nucl! 
Acids Res., 7, 1513, 1979) and examined by restriction endonuclease analysis. Piasmlds with Insertion of the 
A^cl to EcoRI linker will show a single band of DNA of 4200 bp on digestion with either Accl or EcoRI enzyme 
The correctness of the linker insertion on such a piasmid can be verified by DNA sequencing fromeither the 
IrToo 3 foonr 521 Slte by the chemical modffteatlon method of Maxam and Gilbert (Methods in Enzymology, 

a T ° S!^ d ! a transcr, P tion termination fragment, a plasmid with the Accl-EcoRI linker insert, obtained as 
aescribedabove, is digested with EcoR I endonuclease and treated with alkaJlne^hosphatase (Shine JUS 
Patent 4,264,731). The 2650 bp Hindlll fragment of pRIT10162 t obtained as described In Example 3. was 
recioned in the Hindlll site of the pBR327 vector plasmid to form plasmid pRIT12288. pBR327 is described In 
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Soberon et al. (Gene, 9, 287-305, 1980), and can be prepared as described in Example 10, part b, Infra , from 
plasmid pRIT12309 which can be recovered from the American Type Culture Collection under accession 
number ATCC 67187. Figure 8 is a flow sheet which discloses construction of pRIT12288. ? 
The orientation of the Hind lll fragment on pRIT12288 is such as to Juxtapose the arg 3 transcription 

5 termination region with the EcoR I and Clal restriction sites on the pBR327 moiety. From PRIT12288, a 1 180 bp 
EcoR I fragment is obtained which carries the arg 3 transcription termination region. This fragment is ligated to 
the EcoR I digested, alkaline phosphatase treated pRIT12230 derivative described above, and plasmids issuing 
from the ligation mixture are examined for insertion and the orientation of the 1180 bp EcoR I fragment The 
orientation of the insert can be obtained by digestion with Hind lll endonuclease which will liberate fragments of 

10 2880 and 2720 bp if the insert orientation is as desired. 

A plasmid, pRIT12322 t was constructed in which the Acc l -EcoR I linker replaces the 128 bp of unwanted HBV 
DNA and on which the 1180 bp EcoR I fragment from pRIT12288 was present in the desired orientation, 
downstream of the HBsAg coding sequence. A 2900 bp Hind lll fragment can be excised from pRIT1 2322 which 
carries the expression module for HBsAg synthesis including the TDH3 promoter, the HBsAg coding 

15 sequence and the arg 3 transcription termination region. Figure 8 is a flow sheet which discloses construction 
of pRIT12322. See, also, Figure 11, which is a restriction endonuclease map of pRIT12232 and shows which 
portions thereof come from pRIT12288 and pRlT12230. 

Example 9. Construction of plasmid pRIT12329 
20 As described in Example 8 above, pR!T12322 contains a 2900 bp Hind lll fragment with all the necessary » 

signals for expression of HBsAg from the TDH3 promoter. This module was ligated onto a shuttle vector for 

introduction into yeast cells. 
DNA of shuttle vector YEp13 was digested with Hind lll endonuclease and alkaline phosphatase and used as 

recipient for the introduction of the Hind lll fragment from pRlT12322, prepared as described in Example 8. * 
25 Plasmid pRIT12329 was isolated which consists of the YEp13 replicon together with the 2900 bp Hind lll module 

fragment described in Example 8. 

Example 10. Construction of plasmid vectors for the expression of the hybrid antigen HBsAg-CS in yeast 

30 A. Construction of Plasmids pRIT12573 and pRIT12574 

Plasmid WR201 was obtained from the Waiter Reed Army Institute of Research and results from insertion of 
a 2.3 kilobase (kb) Eco RI fragment from XmPfl (Dame et al., Science, 225, 593-599, 1984) encoding the 
complete circumsporozoite protein gene of Plasmodium falciparum into vector pUC8. Figure 2A shows a 
schematic restriction map and sequencing strategy of clone XmPfl. The positions of restriction enzyme 

35 cleavage sites shown in the figure were determined from the sequence and confirmed by digestion: A, Avail; 
Ac, Accl; B, Bstnl; D, Dral; Dd, Ddel; F, Fokl; N, Ndel; R, Rsal; S, Stul; T, Tthlll; Tq, Taql; X, Xholl. Arrows 
indicate the origin, direction and extent of the sequences determined. The CS protein coding region is shown 
as a heavy line. Figure 3A shows the nucleotide sequence of the CS protein gene from P. falciparum . The 
nucleotide sequence of the CS protein gene in XmPfl is shown. Vector pUC8 is described by Vieira et al. (Gene, 

40 19, 259, 1 982) . pUC8 is commercially available from Amersham and Pharmacia. A 1 21 5 base pair (bp) Stul-Rsal * 
fragment of plasmid WR201 containing the CS protein coding sequence minus the first 52 bp (Figure 12) was 
purified by electroelution from a 7.5<Vb polyacrylamide electrophoresis gel. This fragment was digested with 
Sau3A to generate smaller fragments which included, among others, a 192 bp Sau3A fragment coding for 16 
tetrapeptide repeats of the CS protein and three identical 24 bp Sau3A fragments coding for 2 tetrapeptide ? 

45 repeat of the CS protein. 

Plasmid pRIT10911 (prepared as described in Example 5) is a pUC9 derivative carrying a 3136 bp Hind lll 
cassette containing a 1050 bp Hindlll-BamHl yeast glyceraldehyde-3P-dehydrogenase (TDH3) promoter 
fragment (which supplies the ATG) fused at the BamH I site to a 935 bp BamH I -Hpa l fragment of HBV DNA 
enconding the C-terminal 42 amino acids (aa) of the pre-S2 region, 226 aa of the S gene (serotype adw) and 

50 128 bp of 3' non-coding DNA. The S gene is flanked by a 1150 bp yeast DNA fragment carrying the arg 3 
transcription terminator. pUC9 is described by Vieira et al. (Gene, 19, 259-268, 1982). The Bam HI site of 
pRIT1091 1 , located at the ATG initiation codon, is in phase with the Sau3A sites of the repetitive epitope of CS 
of Plasmodium falciparum . DNA of plasmid pRIT10911 was digested with Bam HI endonuclease, treated with 
alkaiine phosphatase, extracted with phenol and recovered by ethanol precipitation. 0.2 (xg of this DNA was 

55 mixed with 0.2 ^ig of the Sau3A digest of the Stul-Rsal CS gene fragment prepared as described above, treated 
with T4 DNA ligase, and the ligation mixture was used to transform competent cells of E. coli K12 strain MM294 
prepared according to the method of Cohen et al. (Proc. Natl. Acad. Sci., U.SA, 69, 2110, 1972). 32 plasmids 
from individual transformant colonies were prepared by a small scale procedure (Bimboim et al., Nucleic Acids 
Research, 7, 1 513, 1979) after amplification of the plasmid by addition of spectinomycine (300 jig per ml) to the 

60 growing cultures. The recombinant plasmids were analysed on 7.5% acrylamide electrophoresis gel, after 
double digestion with Bam HI and Xba l endonucleases, and were compared to the 219 bp Bam HI -Xba l 
fragment of pRIT10911 (comprising the first 42 amino acid coding sequence of the Pre S2 region precursor 
and the beginning of HBsAg region) and to the fragments of a Hae lll digest of cpx174 DNA. 
Among the 32 plasmids, one recombinant showed a Bam HIXba fragment of 411 bp which corresponds to ? 

65 the insertion of the 192 bp fragment Sau3A in the correct orientation and was designated pRIT12572. Another 
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recombinant showed a BamHI-Xbal fragment of 243 bp corresponding to the Insertion of the 24 bp Sau3A 
fragment in the correct orientation and was designated pRIT12571. Figure 12 Is a flow sheet lllustrltinq 
preparation of pRIT12571 and pRIT12572. 

The Hindlll fragments of pRIT12572 and pRIT12571, containing the TDH3 promoter, the CS-HBsAG hybrid 
coding sequence and the arg3 termination region were recloned onto YEp13, to give, respectively, plasmids 5 
PRIT12574 and pRrT12573 (Figure 12). YEp13 is described by Broach et al. (Gene, 8, 121-133, 1979). Figure 12 
is a flow sheet illustrating preparation of pRIT12573 and pRIT12574. 

Plasmids pRIT10912, pRIT12574 and pRIT12573 were introduced into S. cerevlslae yeast strain DC-5 (a, 
leu2-3, Ieu2-112, his3, can 1-11), and into S. cerevislae yeast strain 10S44C (pep4-3 Ieu2-3, Ieu2-112) obtained 
from M. Crabeel located at the Ceria Institute, Anderlecht, Belgium, (see, also Cabezon, fit al., Proc. Natl. 10 
Acad. Sci. U.S.A., 81, 6694-6598, 1984) by transformation of lithium acetate treated cells (Ito et al., J. Bacterid., 
153, 163-168, 1983) and selection for leucine independence. Yeast strain DCS was deposited in the American 
Type Culture Collection, Rockville, Maryland, U.SA, in accordance with the regulations of the Budapest Treaty 
on June 2, 1982, under accession number ATCC 20630. Yeast strain 10S44C was deposited In the American 
Type Culture Collection, Rockville, Maryland, U.SA, In accordance with the regulations of the Budapest Treaty 15 
on August 18, 1986, under accession number ATCC 20819. The expected hybrid protein for pRIT12573 Is 
predicted to contain 277 amino acids, and for pRfT12574 is predicted to contain 333 amino acids. 

It is also possible to add to or enlarge the CS epitope region on the vector of the Invention. For example, on 
plasmids pRIT12572 and PRIT1257I, additional Sau3A fragments of CS protein DNA, e.g., 192 bp or 24 bp 
Sau3A fragments, can be ligated in to the BamH I site as follows: 20 

Plasmld DNA of pRIT12572 or pRIT12571 Is digested with BamH I endonuclease and treated with alkaline 
phosphatase. The 1215 bp Stul-Rsai fragment of plasmid WR201, obtained as described above, is digested 
with Sau3A endonuclease to liberate the 192 bp and 24 bp fragments bearing the tetrapeptide repeats. These 
fragments are mixed with the BamH I and alkaline phosphatase treated pRlt12572 or pRrT12571 vectors, 
treated with T4 DNA ligase and the mixtures transformed into E. coll K12 cells according to conventional 25 
procedures with selection for ampicillin resistance. Plasmids from transformant colonies are screened for the 
size of the BamH I -Xba l fragments by endonuclease digestion and compared to similarly digested DNA of 
pRlt12572 or pRIT12571, especially the 411 bp and 243 bp BamH I -Xba l fragments from pRIT12572 and 
pRlt12571 which carry the fusion between the CS repeats and part of the pre S2-S coding sequence. Increase 
in the size of these fragments by 192 bp or 24 bp Is indicative of the insertion of a tetrapeptide repeat fragment 30 
at the BamH I site of pRIT12572 and pRIT12571. The presence of a BamH I site on such plasmids is Indicative of 
insertion of the 192 bp or 24 bp Sau3A fragments in the correct orientation for fusion at the ATG codon. This 
process described above may be repeated if desirable to Introduce further 192 bp or 24 bp Sau3A fragments 
encoding the CS tetrapeptide repeats and elongated the CS tetrapeptide region on the hybrid particle. As 
stated before, the Hindlll fragments from such derivatives of pRIT12572 or pRIT12571 carrying the expression 35 
module or cassettes may be excised and inserted on YEp13 or other suitable yeast cloning vector for 
introduction into yeast cells by conventional techniques. 

(B) Construc tion of pRIT12309, pRIT12314 and an expression shuttle vector pRIT12377 conta ining the 
complete 2 micron DNA sequence from yeast" \ 40 

The plasmid pRFT12309 consists of the complete 6318 bp 2 micron DNA sequence from yeast cloned Into 
the EcoR I site of the plasmid vector pBR327. The 6318 bp 2 micron DNA sequence was obtained from plasmid 
pCV19 from J. Broach, Princeton University, NJ. Plasmid pRIT12309 was deposited on August 18, 1986 In the 
American Type Culture Collection, Rockville, Maryland, U.S.A., in E. coll K12 strain MM924 in accordance with 
the regulations of the Budapest Treaty under accession number ATCC 67187. Figure 13 is a restriction 45 
endonuclease map of pRIT12309. Insertion on pBR327 of the 2 micron DNA sequence through one of the two 
EcoRI sites present on this molecule interrupts the 2 micron A coding region and renders the resulting hybrid 
molecule pRrT12309 ineffective for A gene function, also referred to as FLP function. For a description of 2 
micron structure and function see Broach J., "The Molecular Biology of the yeast Saccharomyces - Life cycle 
and Inheritance", pp. 445^470, Strathem J.N., Jones E.W. and Broach J.R. (Eds), Cold Spring Harbor so 
Laboratory, New York (1981). 

PBR327 is described by Soberon et al. (Gene, 9, 287-305, 1980) and was obtained from F. Bolivar 
(Department of Molecular Biology, National University of Mexico, Mexico 20DF). It will be appreciated by one 
skilled in the art that pBR327 can be recovered from plasmid pRIT12309 above by preparing pRn"12309 
plasmid DNA recovered from the E. coll strain deposited in the American Type Culture Collection under the 56 
accession ATCC 67187 by any of a number of conventional methods, digesting this DNA with EcoR I 
endonuclease, iigating the digest and transforming competent cells of E coll K12 to ampicillin or tetracycline 
resistance with the ligation mixture. A substantial number of the transformant colonies will contain plasmids In 
which only the pBR327 moiety of pRIT12309 Is present devoid of either of the two 2 micron DNA fragments. 

The 2850 bp Clal-Sall fragment containing the TDH3-arg 3 expression cassette and flanking pBR322 60 
sequences from pRIT12290 (described in Example 3, above) was recloned onto the pRrT12309 vector between 
the Clal and Sa|l sites. The resulting plasmid, pRrT12314, was digested with Sail endonuclease and ligated with 
the 2218 bpSall-Xhol fragment from YEp13 containing the yeast LEU2 gene. Figure 14 is a flow sheet 
Illustrating preparation of pRIT12314. YEp13 is available from the American Type Culture Collection under 
accession number ATCC 371 15. The ligation mixture was used to transform competent cells of strain JA221 65 



31 



0278 940 

prepared according to the method of Cohen et al (above) to both leucine independence and ampiciliin 
resistance. 

Strain JA221 is publicly available, and was obtained from M. Rosenberg, Smith Kline & French Laboratories. 9 
Inc., Philadelphia and may be obtained from the American Type Culture Collection under accession number 
5 ATCC 33875. 

A plasmid, PRIT12544, was obtained from a colony from this transformation in which the 2218 bp Sall-Xhol 
LEU2 gene fragment was inserted at the Sail site of pRIT12314. The orientation of the insertion is such so as to 
recreate the Sail site on the side closest to the TDH3-arg 3 expression cassette. Plasmid pRIT12544 is an E. 
coji - yeast shuttle vector with single BamH I and Sma l sites located between the TDH3 promoter and arg 5 
10 transcription termination regions which are suitable for insertion of DNA coding sequences to effect their 
expression from the TDH3 promoter. Figure 14 is a flow sheet illustrating preparation of pRIT12544. 

(C) Construction of Plasmid pR[T12658 
The 3328 bp Hind lll fragment of PRIT12574, a YEp13 derivative prepared as described in Example 10, Part A, 

15 was recloned onto a pBR322 A BamH I plasmid (where the BamH I site has been destroyed by filling in with T4 
DNA polymerase) to give pRIT12608 where the BamH I site located at the ATG Initiation codon of the 
CS-HBsAg coding sequence Is unique (Figure 15). The 2550 bp BamH I -SaJ I fragment from pRIT12608 
containing the CSHBsAg coding sequence, the arg 3 transcription termination region and flanking pBR322 
sequence was recloned between the BamH I and Sail sites of pRIT12544 (Figure 15), described above, to give 

20 plasmid pRIT12658 (Figure 15). This exchange placed the CS-HBsAg coding sequences under the control of » 
the TDH3 promoter on a complete 2 micron yeast shuttle vector. Figure 15 is a flow sheet illustrating 
preparation of pRIT12658. 

Example 11. Synthesis of hybrid protein in yeast ? 

25 Yeast ( Saccharomyces cerevisiae ) strains DC5 or 10S44C were transformed according to the method of Ito 
et al. (J. Bacterid., 153, 163-168, 1983) with each of the plasmids pRIT10912 (Example 5), pRIT12573 (Example 
10) ; pRFT12574 (Example 10) or pRIT12658 (Example 10), or plasmids pRIT12329, (Example 9) and pRIT10172 
(Exampfe 2), and were separately grown in liquid medium lacking leucine (YNB or YNB + 80 jig/ml histidine) 
and harvested in mid-log phase. Cells from 1 or 2 ml culture were collected by centrifugation and were 

30 resuspended in 50 pj of 0.125 M tris-HCi (pH6.8) containing 200/o glycerol, 40/0 SDS, 6 M urea and 10% 
Z-mercaptoethanol. After incubation for 5 minutes (min.) at 100°, the sample was divided in halves and both 
were electrophoresed through a 12.5% separating gel, 50/0 stacking gel according to the method of Laemmii 
(Nature, 227, 680, 1971). 

HBsAg and CS protein sequences were identified by protein immunoblotting technique (See, Burnette, 

35 Anal. Biochem., 112, 195-203, 1981; Gershoni and Palade, Anal. Biochem., 131, 1-15, 1983; Towbin and 
Gordon, J. Immunol. Methods, 72, 313-340, 1984). After electro- blotting of the proteins onto a nitrocellulose 
filter (Schieicher and SchQII, 0.45 n) (Towbin et al., Proc. Natl. Acad. ScL, U.SA, 76, 435CM354, 1979) the filter 
was preincubated for one hour at 37° C with 3% gelatin in PBS. Subsequently the filter was washed 5 times 
for 5 min. each time with PBS containing 0.1% Tween 20 (polyoxyethyiene sorbitan monolaurate), and one-half 

40 of the sheet was treated for one hour at room temperature with a mixture of five monoclonal antibodies against ? 
CS protein (Dame et al., Science, 225, 593-599, 1984) in PBS, Wo gelatin. The other half of the sheet was 
treated with a monoclonal antibody against HBsAg, HepYTAS12, in PBS, Wo gelatin. HepYTAS12 is an 
antibody raised against purified yeast derived HBsAg which is directed against a reduction and denaturing 
resistent epitope. The filter sheets were washed 5 times for 5 min. each time with PBS, 0. Wo Tween 20, and a 5 

45 second biotinylated sheep anti-mouse antibody (Amersham) in PBS, Wo gelatin was added for one hour at 
room temperature. The sheets were washed again with PBS, 0.1% Tween 20, 5 times for 5 min. each time, and 
were incubated for 30 min. at room temperature with streptavidin-biotinylated horseradish peroxidase (HRP) 
complex (Amersham). The sheets were washed again 3 times for 5 min. each time with PBS, 0.1% Tween 20, 
and were finally incubated with 30 jxl H2O2 and HRP Color Development Reagent containing 4-chlorol-naphtol 

50 (Bio-Rad), 30 mg in 10 ml methanol, in 50 ml PBS. The molecular weight of the antigens was estimated from 
prestalned protein markers, ovalbumin, alphachymotrypsinogen, beta-Iactoglobin, lysozyme, cytochrome C 
(BRL). 

Both yeast strains DCS and 10S44C, transformed with pRIT12573, showed production of an antigen reacting 
with both anti-CS and anti-HBsAg antibodies, and of an estimated molecular weight of 30,000 kilodaltons (kd). 
55 Both yeast strains DC5 and 10S44C, transformed with either pRIT12574or pRIT12658 showed production of an 
antigen capable of reacting with both anti-CS and anti-HBsAg antibodies, and of an estimated molecular 
weight of 37,000 kd. Some lower molecular weight antigen was also detected, Indicative of some degradation 
of the hybrid protein. Control samples of cells containing plasmids without any CS or HBsAg sequences 
(pRIT10172) did not show any reaction. Control samples of Cells containiong plasmids with only HBsAg 
60 sequences (pRJT12329 or pRIT10912) showed reaction only with the anti-HBsAg antibodies. 

These results show that a hybrid protein of the expected molecular weight containing both CS and HBsAg 
sequences is synthesized in yeast transformed with a plasmid containing the CS coding sequence fused in 
phase to the PreS2-HBsAg coding sequence. 
Furthermore, while the band intensity in the anti-HBsAg reaction was about the same for pRIT12573 and * 
65 pRIT12574, the intensity of the anti-CS reaction was much higher for pRIT12574 than for pRIT12573. This 
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indicates that for the same quantity of HBsAg sequences there are more sequences of the repetitive epitope 
of the CS protein present in pRIT12574 than In pRiT12573. 

Example 12. Synthesis of hybrid particles presenting the CSP epitope 

Yeast ( Saccharomyces cerevisiae ) strains DCS or 10S44C containing either pRrT10172 (Example 2), 5 
PRIT12329 (Example 9), pRIT12573 (Example 10) or pRIT12574 (Example 10), were grown In selective medium 
(200 ml YNB or YNB + 80 jig/ml histidlne) to end of log phase. Cells were collected by centrffugatlon and 
resuspended in 10 ml of 50 mM sodium phosphate buffer (pH 7.4) containing 0.5% Tween 20 (polyoxyethylene 
sor bitan monolaurate), 1 mM PMSF (phenylmethylsulfonyl fluoride) and 2.5Vo Isopropanol. Cells were 
disrupted by passage twice through a French Press at 20,000 psl (1.38 x io 8 Pa). The suspension was 10 
centrifuged for 30 min. at 30 000 xg. The total protein concentration of the supernatant liquid (crude cell 
extract), was measured by the method of Lowry et al. (J. Biol. Chem., 193, 265-275, 1951) with bovine serum 
albumin as the standard. The presence <tf HBsAg was assayed using a radlolmmune assay kit (AUSRIA II) 
commercially available from Abbott Laboratories, North Chicago, Illinois or using an EUSA kit (Enzygnost- 
HBsAg micro) commercially available from Behrlngwerke, Marburg, W. Germany. The availability of CS is 
sequences in the HBsAg particles to an Immunoreactlon was tested in the above-mentioned EUSA assay. 
Appropriate dilutions of samples to be tested (in PBS containing 0.20/o BSA) were allowed to bind (overnight at 
room temperature) to the solid phase anti-HBsAg (Enzygnost kit). The bound HBsAg particles were Incubated 
for one hour at 37°C with an 1/10,000 dilution (In PBS containing 0.1o/o BSA) of a mixture of five monoclonal 
antibodies directed against the CS protein (Dame et al„ Science, 225, 593-599, 1 984) as the first antibody, and 20 
then incubated with an 1/500 dilution (in PBS containing O.io/o BSA) of blotlnylated sheep anti-mouse Ig 
(Amersham) as. the second antibody for one hour at 37° C. Detection was by Incubation with an 1/1000 dilution 
(in PBS containing 0.1o/o PBA) of streptavidinblotinylated horseradish peroxidase complex (Amersham) and 
chromogen from the Enzygnost kit for 30 mln. at 37° C. Between incubations the trays were washed four times 
with washing solution from the Enzygnost kit. Color development was measured at 510 nm in a Trtertek 26 
Multiskan (Dynatech). 

Measurement of protein concentration and AUSRIA activity (Table II) in crude cell extracts showed that 
PRIT12573 directed HBsAg expression at the same level as pRIT12329. However, pRIT12574 showed about a 
ten-fold lower AUSRIA activity. Immunoblot analysis of these extracts showed about an equal amount of 
HBsAg related protein synthesized In those three cases. 30 

Table II - AUSRIA activity in crude cell extracts 



Plasmid Gene Protein HBsAg (RIA) HBsAg 

(mg/ml) (ng/ml) ng/mg 
— protein 

40 



PRIT10172 








5. 


.6 


0 


0 


PRIT12329 


S 






3. 


.3 


5700 


1730 


PRIT12573 


CS( 24 


bp) 


PceS2-S 


3. 


.6 


4000 


1110 


PRIT12574 


CS(192 


bp) 


PreS2-S 


2. 


.8 


450 


160 



45 



The EUSA assay showed that a structure expressing both HBsAg and CS epitopes Is present In pRIT12573 
and pRIT12574 extracts (Table III) and that there are more CS epitopes per HBsAg epitopes that react In 
PRIT12574 extracts than in pRIT12573 extracts. 

To examine the nature of these RIA and ELISA positive structures, one ml of the crude cell extracts was 55 
mixed with 1.5 M CsCI in 50 mM sodium phosphate (pH 7.4) and centrifuged for 40 hours at 40,000xg In a 
Beckman SW50.1 rotor. Collected fractions were assayed for the presence of HBsAg and CS antibody binding 
activity with the RIA and ELISA methods described above. HBsAg and CS antigenic activity was found to 
coequili brate in both pRIT12573 and pRIT12574 gradients. Their buoyant density was found to be slightly 
higher (rho = 1.20 g/cm 3 ) than the one for pRIT12329 (rho - 1.18 g/cm 3 ). 60 

Immunoblot analysis of the gradient fractions confirmed the RIA/ELISA results. HBsAg and/or CS 
sequences were found only in those fractions that did react In the RIA/ELISA assay. 



t 
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Table - II. HBsAa and CS epitopes present in antigen 
bound to immobilized anti-HBsAg antibody 



10 



Crude cell 
extract from: 



Dilution ELISA (HBsAg) EL ISA (CS) 



10S44C (pRIT10172) 



OD e . rt nm 
510 



lOOx 



0.000 



OD_. rt nm 
510 



0.065 



15 



20 



25 



30 



10S44C (pRIT12329) 



10S44C (PRIT12573) 



100X 
200X 
400X 

100X 
200x 
400X 
800X 
1600X 
3200X 



0.580 
0.341 
0.188 

1.298 
1,080 
0.702 
0.336 
0.155 
0.075 



0.054 
0.041 
0.056 

1.596 
1.299 
0.898 
0.560 
0.323 
0.189 



10S44C (pRIT12574) lOOx 0.662 1.678 

35 200X 0.370 1.506 

400X 0.183 1.130 

800X 0.079 0.798 

40 1600X 0.044 0.449 

3200X 0.061 0.205 



45 



These results indicate that the hybrid protein produced by expression of pRIT12573 and pRIT12574 does 
assemble into particles like the 22 nm HBsAg particles upon synthesis in yeast. The particles produced by 
50 expression of pRIT12573 and pRIT12574 do expose the CS sequences on their exterior as shown by their 
capacity to react with anti-CS antibodies. In the case of PRIT12574, those CS sequences partially obstruct the 
accessibility of the a determinant of HBsAg (the one mainly responsible for AUSRIA activity) in the RIA test. 

Example 13. 

55 

A. Purification of hybrid particles 
The crude cell extract of yeast strain 10S44C containing pRm2574 ( prepared as described in Example 10, 

was clarified by polyethylene glycol (PEG) 400 precipitation and concentrated by ultrafiltration in an AMICON 

DC equipped with a cartridge having a cut-off of 100.000 daltons. 
60 Further purification of the hybrid particles was done by methods known in the art, for example, CsCI 

centrifugation, hydroxyapatite chromatography and gel filtration. 
The final purified preparation from ceil extract of 10S44C containing pRIT12574 showed one main peak in the 

exclusion volume of a HPLC TSK-3000 column (LKB) which contained both HBsAg and CS antigenicity. 

SDS-polyacrylamide gel electrophoresis (Laemmli, Nature, 227, 680, 1971) followed by silver staining (Wray et 
65 a!., Anal. Biochem., 118 , 197-203, 1981) showed one major band of estimated molecular weight (MW) 37,000 
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kd. Electron microscopy after negative staining with uranyl acetate showed particles of about the same size 
and shape as yeast derived HBsAg. 

B. Purification of hybrid particles 

Crude yeast extracts are prepared by Dyno Mill disruption of the washed yeast cells In the presence of an 5 
equal weight of 50 mM sodium phosphate extraction buffer (pH 8.1 ) supplemented with 4 mM THriplex III, 1 <Vb 
Tween 20, 4 mM PMSF and 10 Wo isopropanol. 

Cell debris are removed by centrifugation at 1 1 .OOOxg for 90 mln. Five hundred ml centrifuged crude extract 
are adjusted to pH 7.2 with 1 N acetic acid and stirred overnight at 4°C In the presence of 10 gr colloidal Silica 
(Aerosll 380 t Degussa). Subsequently, the suspension is centrifuged at 3,500 g for 1 hour and the pellet is 10 
washed three times with 150 ml of 0.15 M NaCI supplemented with 2 mM PMSF and 5 mM EDTA during 15 min. 
Finally the Aerosil pellet is eluted with 250 ml 10 mM pyrophosphate buffer (pH 9.5) containing 2 <M> Tween 20 at 
37° C for 3 hours. To the desorbate is added dropwlse a 1 M CaCl2 solution to a final concentration of 30 mM 
CaCl2 and the pH is adjusted to 7. The solution is left overnight at 4°C and centrifuged at 6,500g during 15 
minutes.The supernatant is dlalysed against 2 x 51 10 mM Tris/HCI (pH 7.1) at 4°C and subsequently diluted is 
four times with dialysis buffer. 

After addition of 30 mM CaCl2 and 1 M NaCI the diluted antigen solution, adjusted to pH 7.1 , Is passed over a 
150 m! column of Phenyl Sepharose FF, equilibrated In the same CaCl2/NaCI containing Tris buffer, at a flow 
rate of 30 cm/hr. Subsequently the column is washed with equilibration buffer until OD280 nmdecreases to 
zero, and eluted at the same flow rate with 6 M urea in 10 mM Tris/HCI pH9. 20 

Alternatively, the dlalysed and four times diluted antigen solution Is supplemented with 20 Wo (NH4)2S04 (pH 
7) and passed over a 150 ml column of Phenyl Sepharose FF, equilibrated in 10 mM Tris/HCI (pH 7.1) 20 o/b 
(NH 4 )2S04, at a flow rate of 30 cm/hr. After washing the column with equilibration buffer, the column Is eluted 
at the same flow rate with 10 mM Tris/HCI pH 7.1. 

The antigen positive fractions are pooled and finally centrifuged In one or two successive CsCI den sity 25 
gradients at 4°C and 245,000 g for 65 hrs. The purified hybrid Cs-HBsAg particles have a buoyant density of 
1.23gr/crn3. 

Example 14. Immunogenicity of hybrid particles 

Hybrid particles, purified from S. cerevlsiae strain 10S44C containing PRIT12574, prepared by the method of 30 
Example 13, were inoculated into laboratory animals, as such or with aluminum hydroxide as adjuvant (Young 
et al., Science 228, 958-962, 1985). 

C57B1/6 mice (6-8 weeks), guinea pigs, and rabbits were immunized subcutaneously and Intraperitoneal^ 
on days 0, 21 and 49. Animals were bled on days 7, 28, 56 and 84. Blood was allowed to clot at 4°C overnight 
and sera were separated and stored at -70° C until used. 35 

Rabbits, 2 groups of 2 animals each, received 10 \ig of the hybrid CS-HBsAg protein In 1.0 ml doses, with or 
without aluminum hydroxide, at each Immunization. As a control, 2 rabbits were Immunized with 10 pg 
Heptavax B® (Merck Sharpe & Dohme) following the same inoculation scheme. Mice and guinea pigs, in 
groups of five animals, received 1 jig arid 5 \ig hybrid protein respectively, in 0.5 ml doses at each 
immunization. Nonimmunized animals served as controls. 40 

For determination of antibody responses, sera from mice were pooled for each group white guinea pig and 
rabbit sera were assayed Individually. Antl-CS antibody titers were measured by EUSA assay using the 
recombinant CS protein R32tet32 as antigeh (See, Young et al., Science. 228, 958-962, 1985). Anti-HBsAg 
titers were determined by using the AUSAB kit (Abbott Laboratories) and the WHO reference. 

Neither guinea pigs nor mice developed appreciable antibody responses to HBsAg despite repeated 45 
boosting. In constrast, both animal species developed antl-CS antibodies and these responses were 
augmented by boosting. An absorbance of 1 .0 in the ELISA assay was obtained with serum dilutions of 1600 
fold (mice) and 9000 fold (guinea pigs). Immunogenicity of the hybrid particles was not enhanced by 
adsorption to aluminum hydroxide. 

In contrast rabbits developed antibodies to both CS and HBsAg. Reciprocal titers at an absorbance of 1 .0 In 50 
the antl-CS EUSA assay were between 800 and 3200. Anti-HBsAg titers obtained with the hybrid particles 
adsorbed to aluminum hydroxide were between 4000 and 20,000 mUl/ml while the rabbits Immunized with 
Heptavax gave titers of 10* to 10 e mUl/ml. Adsorption of the hybrid particles to aluminum hydroxide enhanced 
the immune response. 

Sera from mice, guinea pigs and rabbits all reacted strongly In the circumsporozolte precipitin reaction as 55 
well as inbibited sporozoite Invasion of hepatoma cells In vitro (Young et al., Science, 228, 958-962, 1985) . (See 
Table IV). Both reactions are indicative of a protective immune response. 
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Table IV. Circumsporozoite precipitin (CSP) reacti - 
vity, and percent inhibition of sporozoite * 
invasion (% ISI) of HepG2-A 16 hepatoma cells 



Animal(s) CSP Reactivity % isi 



10 



15 



1 week 


cost 2nd boost 


f4+ 


2 + 


0) 




Pooled 


mouse sera 


14 


9 


2 


91% 


Rabbit 


#36 


22 


3 


0 


96% 


Guinea 


pig #1107 


24 


1 


0 


100% 



Degree of CSP reactivity is shown in parentheses: 
0 - No CSP reactivity detectable 
2+ - granular precipitate on surface of sporozoite 
4+ - threadlike filament from end of sporozoite 



Example 15. Vaccine Preparation 

30 An illustrative vaccine of the subject invention is prepared as follows: To a buffered, aqueous solution of 30/o 
aluminum hydroxide (10 mM sodium phosphate, 150 mM NaCI, pH 6.8; sterilized by filtration), the particle of 
the invention in similar buffer is added with stirring to a final concentration of 100 jig/ml of polypeptide and 0.5 
mg/ml of aluminum (Al 3+ ). The pH Is maintained at 6.8. The mixture is left overnight at about 0°C. Thimersol is 
added to a final concentration of 0.0050/o. The pH is checked and adjusted, if necessary, to 6.8. 

35 While the above fully describes the invention and ail preferred embodiments thereof, it is to be appreciated 
that the invention is not limited to the embodiments particularly described but rather includes all modifications 
thereof coming within the scope of the following claims. 

EXAMPLES OF Pre S2-S PROTEIN AND PreS1-Pre S2-S PROTEIN PRODUCTION BY YEAST 
40 it has been discovered that the insertion of the Pre S2-S protein coding region of this invention in a yeast ? 
expression vector into a yeast results in the synthesis of particles resembling the authentic 22 nm HBsAg 
particles exposing glycosylated Pre S2 protein on their surface. It has not been previously correctly reported 
that yeast, transformed with any portion of the HBV genome, express a glycosylated product. 
The following examples relate to the expression in yeast of the glycosylated form of a protein containing the * 
45 Pre S2 region in addition to the S protein of Hepatitis B surface antigen. The protein can be extracted and 
purified from yeast cells in the form of particles for use as a human vaccine. The Pre S1-Pre S2-S protein 
expressed in yeast cells is also glycosylated and may also be extracted and purified therefrom in the form of 
particles for use in a vaccine against Hepatitis B virus. 
With respect to the finding of glycosylation, the following should be noted: 
50 1 . Examples 25, 26 and 27 concern concanavalin A binding, tunicamycln effect and sensitivity to Endo 

H, respectively. Each prove that the pre S2-S protein species of 33Kd is glycosylated. Thus, glycosylation 
has been proven by three independent methods. 

2. The specificity of these reagents are, in general terms: 

a) concanavalin A binds preferentially to mannose residues. 
55 b) tunicamycin inhibits all types of giycosyiation that are N-glycosidically bound to asparagine. 

c) endoglycosidase H cleaves asparagine-linked high mannose oligosacchardies leaving one 
N-acetylglucosamine residue attached to the asparagine. 

3. From the combined results of Examples 25, 26, 27 and the above, it is deduced that there is a 
N-linked oligosaccharide chain present on the 33Kd form of the pre S2-S protein (tunicamycin inhibition, 

50 sensitivity to Endo H) which is of the high-mannose type (sensitivity to Endo H). The shift in molecular 

weight observed (about 3000) upon tunicamycin treatment and Endo H digestion, indicates that the 
oligosaccharide structure is composed of about 10 sugar residues. This is about the size of a core 
glycosylation in yeast. Thus, probably only one of the three potential glycosylation sites in the Pre S2-S 
protein is glycosylated. * 

55 4. The purification schemes describeD in Examples 28 and 29 both contain an affinity chromatography 



20 



25 



36 



* 



0 278 940 



step based on sugar residues of a glycoprotein. 

5. Phenyiboranate has a specificity for 1 ,2-cis-diol groups (2 adjacent OH groups sterically positioned In 
» the same direction). 

6. Lentil Sepharose is glucose/mannose specific. 

Example 16. Construction of pRIT12662 - an E. coli vector containing the PreS2-S expression cassette (Figure 
16). 

Plasmid pRIT12290, obtained as described in Example 3, above, was modified by the Insertion of a 
BamH I -EcoR I synthetic adaptor fragment encoding the N-terminal amino acids of the pre S2 region between 
the TDH3 promoter and ARG3 terminator fragments. This was done as follows: 

Piasmid pRIT1 2290 was digested with BamH I and EcoR I enzymes and dephosphorylated with alkaline phos 
phatase. 200 pmoles of the phosphoryiated synthetic adaptor. 

Gin Trp 

BamH I 5 1 GATC CAG TGG 3 1 EcoR I 

3' GTC ACCTTAA 5 1 



were ligated with 0.1 pmoles of the BamH I -EcoR I dephosphorylated plasmid pRIT12290 using 1 unit (U) of T4 20 
ligase and used to transform the E coli strain MM294 to ampicillin resistance. One transformant harboring the 
desired plasmid with the synthetic adaptor inserted between the BamH I and EcoR I sites was identified and 
retained. The plasmid is identified as pRIT12621. Starting with plasmid PRIT12621, the next steps were the 

* elimination of the four extra base pairs (the core of the BamH I site) In order to put the ATG codon in phase with 

the second codon of the Pre S2 region, and the insertion of an EcoR I fragment containing the rest of the pre S2 25 
coding region and the complete S coding region into the EcoR I site of pRIT12621 to complete the entire pre 
S2-S coding region. 

At this stage the DNA manipulations including subsequent linearization, deletion and recircularlzation of the 
plasmid pRIT12621, were realized without amplification by transformation of intermediate plasmlds: the 
procedure was as follows: 30 

30 pmoles (120 jig) of pRIT12621 DNA were linearized with 120 U of BamH I. After elimination of the 
restriction enzyme by phenol extraction, the plasmid was ethanol precipitated and resuspended in the 
appropriate buffer to digest the BamH I single strand extensions with 280 units (U) of mung bean nuclease. The 
nuclease was eliminated by phenol extraction followed by ethanol precipitation. The so treated plasmid was 
resuspended in the ligation buffer and recircularized with 10 units of T4 ligase. 35 

The preparation containing the ligated plasmid was phenolized, ethanol precipitated and resuspended Into 
appropriate buffer and digested with EcoR I enzyme. After elimination of EcoR I endonuclease by phenollzatlon 
and ethanol precipitation, 0.05 pmoles (0.2 *ig) of DNA were resuspended in the ligation buffer In order to be 
ligated with 0.4 pmoles (0.2 jig) of a 838 base pair EcoR I fragment isolated from pRIT12581 that contains the 

* entire C-terminal coding region of the pre S2-S protein. A plasmid essentially similar to pRrT12581 can be 40 
constructed as follows. The vector pUC9 (available commercially from Amersham and Pharmacia) Is digested 

with EcoR I endonuclease and treated with alkaline phosphatase (Shine, U.S. patent 4,264,731). Plasmid 
pRIT10616 (ATCC 39131), is digested with EcoR i and Acci restriction endonucleases and the 826 EcoR I -Acc I 

* fragment containing part of the pre S2-S coding region is recovered by preparative acrylamlde gel 
electrophoresis and electroeiution. About 0.4 pmoles (0.2 jig) of this fragment is mixed with about 10 pmoles 45 
of the following 12 bp synthetic adaptor previously annealed as 



5' 

Acci 



3 1 

described in Example 8, above, and the mixture is treated with T4 DNA ligase and digested with EcoR I 
restriction endonuclease. To the so treated mixture is added about 0.2 ng of the EcoR I digested alkaline 
phosphatase treated pUC9 vector, prepared as described above, and the mixture Is treated with T4 DNA ligase 60 
and used to transform competent cells of E. coli K1 2 to ampicillin resistance by conventional procedures. 
Colonies from the transformant are examined for the presence of a plasmid consisting of the 2650 bp pUC9 
vector together with a 838 bp EcoR I fragment comprised of the 826 bp EcoRI-AccI pre S2-S fragment together 
* with the 12 bp Acc l -EcoR I synthetic linker. The correctness of the construction so identified can be verified by 

DNA sequencing using the chemical modification method of Maxam and Gilbert (Methods in Enzymology, 65, ^ 



lie 

Tyr Stop 50 
ATACATTTAACG 3 1 

EcoR I 

TGTAAATTGCTTAA 5 ' 55 



* 
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499, 1980), preferably from a unique flanking restriction site in the pUC9 polylinker. 

The ligation mixture containing plasmid DNA of pRIT12621, treated as described above, and the 838 bp 
EcoR l fragment of pRIT12581 was used to transform the E. coli strain MM294 according to the method of 
Cohen et al., cited above. One transformant having a plasmid with the EcoR l fragment in correct orientation 
was retained and the plasmid is identified as pRIT12662 (See, Figure 1A and Figure 16). Figure 1A is a flow 
sheet illustrating the preparation of pRIT12662. Figure 16 is a restriction endonuclease map of pRIT12662. The 
pre S2 region from pRiT12662 was sequenced by the chemical modification procedure of Maxam and Gilbert 
(Methods in Enzymology, 65, 499, 1980) to check that the coding reading frame at the N-terminus was as 
follows: 



Met Gin Trp Asn Ser 
AAAC AAAC AAA ATG GAG TGG AAT TCC 
15 pTDH3 pre S2-S 

coding region 

It will be appreciated by one skilled in the art that the vector pRIT1 2662 may also be used for the introduction 
20 of further functional DNA sequences Into the preS2 region by appropriate in vitro manipulation of vector DNA. 
The preS2 coding sequence contains restriction sites for EcoR l, Fst l and BamH I endonucleases. Any of these 
sites may be used for introduction of functional DNA sequences encoding peptides of interest to create in 
phase fusions with the preS2 region. Such fusions will be of the type preS2 - introduced sequence - preS2-S. 
These restriction sites may also be manipulated in vitro to create other vectors and provide other sites or 
reading frames for insertion of functional DNA sequences, such as by the method of Botstein et al. (Science, 
229, 1193-1201, 1985). 



Example 17. Construction of pRIT12377 and of pRIT1 2660 a yeast plasmid expressing the preS2-S protein 

DNA of plasmid pRIT12309, obtained as described in Example 10(B), was digested with Sail endonuclease 
and ligated with a 2218 bp Xho l-Sall fragment of DNA containing the yeast LEU2 gene obtained by digestion of 
YEp13 and purification of the fragment by acrylamide gel electrophoresis. The ilgated mixture was used to 
transform cells of E. coli Ki2 strain JA221 , prepared according to the procedure of Cohen et al., cited above, 
with selection being made for both ampicillin resistance and leucine independence. Strain JA221 was obtained 
from M. Rosenberg, Smith Kline and French Laboratories, Philadelphia, Pennsylvania, U.S.A., and is available 
from the American Type Culture Collection under accession number 33875. A plasmid, PRIT12377, was 
identified from one transformant colony which contains the 2218 bp LEU2 Xho l-Sall fragment inserted at the 
Sail site of pRIT12309. 

The orientation of the fragment on the vector is such as to recreate the Sail site on the Aval side of the 
pBR327 part of the vector. Plasmid pRIT12377 is an E. coli - yeast shuttle vector having the necessary 
functions for selection, replication and maintenance in both organisms. 

The 3050 base pair Hindi 1 1 fragment from pRIT12662 (Example 16), containing the pre S2-S expression 
cassette was purified by acrylamide gel electrophoresis, treated with T4 polymerase and ligated to pRIT12377 
previously opened by BamH I enzyme and T4 polymerase repaired. This preparation was used to transform 
strain E. coli MM294 to ampicillin resistance according to the method of Cohen et al., cited above. An E. coli 
transformant harboring the plasmid pRIT12660 was retained. Figure 17 is a flow sheet illustrating preparation 
of pRIT12660. 

This plasmid, PRIT12660, was used to transform two strains of S. cerevisiae , i.e., strain DCS cir° and strain 
10S44C cir°, according to the method described by Hinnen et al. (Proc. Natl. Acad. Sci., U.SA, 79, 2157-2161, 
1978). 

Both strain DC5 cir° and 10S44C clr° were deposited in the American Type Culture Collection, Rockville, 
Maryland, U.S.A., in accordance with the Budapest Treaty on August 18, 1986 under accession numbers ATCC 
20820 and ATCC 20818, respectively. 

Example 18 . The preS2 region in plasmid pRIT12662. 

The preS2 region in plasmid pRIT12662 was sequenced using the chemical modification method of Maxam 
and Gilbert, Proc. Natl. Acad. Sci., 74, 560-564, 1977) and it was found that there were four base pair changes 
leading to three amino acid changes when compared to another virus of adw2 serotype (Valenzueta et al., 1980 
-Table I page 11). An alanine residue instead of a threonine residue at position -45, a phenylalanine instead of a 
leucine at position -34, and a serine residue instead of an rsoleucine residue at position -11. The last change 
affects a residue that had been found invariable in ail known serotypes until now (Lo et al., Biochem. Biophys. 
Res. Com., 2, 382-388, 1986). 



38 



A * 



0 278 940 



Example 19. Synthesis of particles carrying a receptor for polymerized human serum albumin (pHSA) 

Yeast (S. cerevisiae ) strains DCS cir° of 10S44C cir° containing either pRiT12660 or pRIT12363 (Example 
16) were grown in selective medium [200 ml YNB + 80 jig/ml hlstidine (DCS cir°) or YNB (10S44C cir 0 )] into 
log phase. (pRIT12363 is identical to pRIT12660 except for the hepatitis derived gene; pRIT12363 contains the 
S gene only). Cells were collected by centrifugation and resuspended in SO mM sodium phosphate pH 8.0 5 
containing 0.6% Tween 20 (polyethylene sorbltan monolaurate), 1 mM PMSF (phenyimethylsuifonyl fluoride) 
and 2.5% isopropanol. Cells were disrupted by passaging twice through a French Press at 20,000 psi (1.38 x 
10 8 Pa). The suspension was centrifuged for 30 mln. at 30,000 X g. Total protein concentration In the 
supernatant liquid (crude ceil extract) was measured by the method of Lowry et al., (J. Blot. Chern., 193 , 
265-275, 1951), with bovine serum albumin as standard. The presence of HBsAg related material was assayed 10 
using a radioimmune assay kit (AUSRIA II) commercially available from Abbott Laboratories. The results (Table 
V) indicate that pRiT12660 directs the synthesis of material antigenlcally related to HBsAg particles. The 
expression level, based on AUSRIA activtty, is about the same in yeast strains DC5 cir° and 10S44C cir 0 . 

In a radioimmune assay to detect a receptor for polymerized human serum albumin (pHSA) (Pontisso et al. t 
J. Virol Methods , 6, 151-159, 1983), crude cell extracts from pRIT12660 containing cells gave a positive 15 
reaction while crude cell extracts from pRlfl 2363 containing cells did not. No reaction above background was 
observed with polymerized bovine serum albumin. 

CsCI equilibrium centrifugation of crude cell extracts from pRIT12660 containing cells showed the AUSRIA 
positive material to band around rho -1.20 g/cm 3 as do serum derived or yeast derived HBsAg particles. 
Material recovered from this portion of the gradient gives a positive result In the pHSA test. 20 

The aforementioned results show that in pRIT12660 containing yeast cells A HBsAg related protein is 
expressed which is assembled into a structure like that of 22 nm particles from serum and exposing a receptor 
for pHSA on its surface. 

25 

Table VA. AUSRIA activity in crude cell extracts 



Strain Plasmid Protein HBsAg related HBsAg related 

mg/ml antigenicity antigenicity 
ng/ml ng/mg protein 



30 
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DC5 cir° pRIT12660 13.4 

DC5 cir° pRIT12660 12.6 

10S44C Cir° pRIT12660 10.8 

10S44C cir° PRIT12660 9.5 



7 
7 
6 
6 



522 
555 
556 
631 
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Example 20. Four S-related protein species are found to be expressed in yeast transformed with pRiT12660. 

To analyze the HBsAg related protein monomers expressed in pRIT12660 containing cells, cells were grown 50 
in selective medium, either in Erlenmeyer flasks or In fermenters, into log phase and collected by 
centrifugation. 

To the packed cells (0.1 gram), first 20 pJ 40 mM PMSF In isopropanol, and then 200 pJ sample buffer for 
SDS-polyacrylamide gel electrophoresis (0.125 M tris-HCI, phi 6.8 containing 20P/b glycerol, 4% SDS, 6 M urea 
and 100/o 2-mercapto-ethanol), were added. Samples were vortexed and allquots of 2 to 20 nJ were further 55 
diluted in sample buffer to a final volume of 40 jxl, incubated for 5 minutes at 1 00° and electrophoresed through 
a 12.5% separating, 5% stacking gel according to the method of Laemmii (Nature, 227, 680, 1971). 

After electroblotting (Towbin et al., Proc. Natl. Acad. Scl., U.S.A., 76, 350-4354, 1979) of the proteins onto a 
nitrocellulose filter (0.45 pjn, BioRad), the filter was prelncubated for one hour at 37° C with 3% gelatin in PBS. 
The filter was incubated with a monoclonal antibody directed against a denaturation and reduction resistant 60 
epitope of human derived HBsAg (monoclonal 6 obtained from H. Thomas, Royal Free Hospital, London, 
England) and antibody binding was detected by successive incubation with biotinylated sheep anti-mouse Ig 
(RPN 1021, Amersham, dilution 1/250 in PBS containing Wo gelatin), steptavidln-blotlnylated horseradish 
peroxidase complex (RPN 1051 , Amersham, dilution 1/400 in PBS containing 1<Yo gelatin) and finally 30 |xl H2O2 
and HRP Color Development reagent containing 4-ch!oro-1-naphtol (BioRad), 30 mg in 10 ml methanol, in 50 65 
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ml PBS. Between incubations the filter was washed with PBS containing 0.1 % Tween 20. 

The immunoblot showed 4 protein bands related to the S protein (p23) of estimated molecular weights of 
about 33Kd, 30Kd, 28Kd and 25Kd which are detected by the monoclonal antibody. The majority of S related 5 
material Is found in the 33K protein band and was found to migrate slightly faster than the gp33 protein 
5 described by Stibbe and Gerlich (Virology, 123 , 436-442, 1982). The smallest protein band detected migrates 
slower than the S protein (p23) synthesized in yeast. 

Example 21. The four S-related protein species (of 33Kd, 30Kd, 28Kd and 25Kd) are assembled in particles. 
Crude ceil extracts of pRlT12660 (Example 16) containing yeast cells were prepared as described in 
10 Exampled 1 9 or by grinding yeast cells in a Dynomill in the presence of an equal weight of extraction buffer (200 

mM trfs, 3 M NaCI, 10 mM EDTA, 10 mM ethylene glycol bis (beta-aminoethyl ether) N,N,N',N'-tetraacetic acid 

(EGTA), 4 mM PMSF, 10°/b isopropanol) and subsequent centrifugation for 46 minutes at 30,000 X g. 
The pRIT12660 directed particles were partially purified from the crude cell extract by ultrafiltration, CsCI 

equilibrium centrifugation and hydroxyapatite chromatography which are methods known in the art. The four 
15 protein bands of S-related material detected by Western blot analysis copurify during these purification steps. 

An immuno-blot analysis (as described in Example 20) of the AUSRIA positive CsCI fractions after equilibrium 

centrifugation showed that the four S-related bands were present in the same relative proportions in each 

fraction. 

The material is thus distributed in homogenous fashion throughout the peak fractions of the gradient. 

20 9 
Example 22. The 33Kd and 30Kd proteins carry a receptor for polymerised human serum albumin. 

Human serum albumin (Sigma, A 8763) was polymerised by glutaraldehyde treatment (Merck, A-12179) and 
purified according to Pontisso et at (J. Virol, methods, 6 151-159, 1973). Reactivity of the four S-related 
proteins towards pHSA was tested in a Western blot procedure. Partially purified preparations of pRIT12660 * 

25 directed particles (Example 21) were separated by SDS polyacrylamide gel electrophoresis and transferred to 
a nitrocellulose membrane as described in Example 20. The nitrocellulose filter was successively incubated 
with pHSA (15 iig/ml in PBS containing 1% gelatine), rabbit anti-serum to human albumin (Behringwerke, 
ORCB 04/05, 1/125 dilution in PBS containing 1% gelatin), biotinylated donkey anti-rabbit Ig (Amersham RPN 
1004, 1/250 dilution in PBS containing 1% gelatin), streptavidin-biotinylated horseradish peroxidase 

30 (Amersham RPN 1051 1/400 dilution in PBS containing 1% gelatin) and finally with H2O2 and 
4-chIoro-1-naphtol as described in Example 20. 

The two largest S-reiated protein bands (the 33Kd and 30Kd) reacted with pHSA, the two smallest ones 
(28Kd and 25Kd) gave no reaction. 

35 Example 23. The 33Kd and 30Kd proteins contain a sequence specific for the preS2 region of the preS2-S 
protein 

A peptide corresponding in 23 of 26 positions to the 26 N-terminal amino acids of the preS2-S protein was 
synthesized (NH2-Met-Glu-Trp-Asn-Ser-Thr-AIa-Phe-His^ 

Leu-Pro-AIa-Gly-Cys-COOH) and coupled via the C-terminal to keyhole Limpet hemocyanin (Peninsula 

40 Laboratories). Rabbits were immunized subcutaneously with 100 ^g of the conjugate (which corresponds to > 
20 [ig peptide) in Freund's complete adjuvant, boosted at days 8 and 15 with another 100 jig conjugate in 
Freund's complete adjuvant and at days 28, 69 and 149 with 100 jig conjugate in PBS. Development of antibody 
titer was followed by taking blood samples at regular intervals and testing serum dilutions by incubation on 
solid-phase fixed synthetic peptide. Bound antibodies were detected by iodinated protein A (New England * 

45 Nuclear). For immunoblot applications, sera with antipeptide titers greater or equal to 1/5120 were selected. 
The serum was used as such after a 100 fold dilution or after partial purification of IgG. IgG was partially purified 
by precipitation with Na2S04 (120 mg per ml of serum), resuspension in PBS and reprecipitatlon with 140/0 
Na2S04. The resuspended IgG solution was desalted by passage on a PD10 column (Pharmacia). 
Detection of reactivity of these antibodies towards the S-related proteins in immunoblots was by 

50 biotinylated donkey anti-rabbit Ig, streptavidin-biotinylated horseradish peroxidase and H2O2 and 4-chloro- 
1-naphtol as described in Example 22. 

Proteins from partially purified preparations of pRIT12660 directed particles (Example 21) were separated by 
SDS polyacrylamide gel electrophoresis and transferred to nitrocellulose as described in Example 20. Also run 
on the gel was S protein (p23) from particles purified from yeast harboring pRIT12363. The immunoblot 

55 showed that the two largest S-related protein bands (the 33Kd and 30Kd) react with the anti-peptide 
antibodies while the two smallest ones (28Kd and 25Kd) give no reaction. The S protein (p23) gives no reaction 
with the anti-peptide antibodies. 

Example 24 Human anti-Hepatitis B antibodies from naturally infected persons recognize epitope(s) on the 

60 33Kd and 30Kd preS2-S proteins not present on the S protein. 

Reactivity of pooled gamma globulin from anti-hepatitis B positive human subjects (Belgian Red Cross, 
100-150 lU/ml Lot 85A08) towards the four S-related proteins was tested by the immunoblot procedure 
described in Example 20. A partially purified preparation (Example 21) of pRIT12660 directed particles was 
dissociated by SDS and the proteins separated by SDS polyacrylamide gel electrophoresis. After transfer of » 

65 the proteins to nitrocellulose, the membrane was incubated with a ten-fold dilution of anti-Hepatitis B gamm 
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globulin in PBS containing Wo gelatine.. Antibody binding was detected by successive incubation with 
biotinylated sheep anti-human Ig (Amersham RPN1003, 1/250 dilution in PBS containing 10/ 0 gelatin) 
steptavidin-biotinylated horseradish peroxidase and finally with H 2 0 2 and 4-chIoro-1-naphtoi as described In 
Example 20. 

The two largest S-related protein bands (the 33Kd and 30Kd band) react with the human antibodies the two s 
smallest ones (the 28Kd and the 25Kd band) give no reaction. The S protein (p23) derived from particles from 
yeast cells containing pRIT12363 does not react with these human antibodies. These results show that gamma 
giobulln obtained from infected persons specifically recognizes epitopes present on denatured and reduced 
preS2-S proteins which are absent on denatured and reduced S protein. It is known that the epitopes present 
on S protein derived from human serum and yeast are predominantly conformational. 10 

Example 10A, The 33Kd protein species carries a mannosyl containing oligosaccharide structure. 

PreS2-S protein from partially purified preparations of particles from pRIT12660 containing 10S44C cir° cells 
was eiectrophoretically separated and transferred to nitrocellulose as described in Example 20 Also run on 
the gel were S protein from particles purified from yeast containing pRIT12363 and HBsAg particles from 15 
infected human serum obtained from Dr. W.H. Gerlich, Department of Medical Microbiology, University of 
Gottingen, Federal Republic of Germany. 

One half of the nitrocellulose filter was incubated in 60 ug/ml concanavalin A (Caiblochem A grade) in 10 mM 
trls pH 7,5, 150 mM NaCI 1 mM CaCI 2 1 mM MnCI 2 0.05tVo Tween 20 and subsequently with 30 iig/ml 
horseradish peroxidase (Sigma type VI P 8375) in 10 mM tris pH 7.5 150 mM NaCI O.OSQfo Tween 20 and finally so 
H2O2 and 4-chloro-1-naphtoi as described in Example 20. 

Between incubations the filter was washed with 10 mM tris pH 7.5 150 mM NaCI 0.060A) Tween 20 (Hawkes 
Anal Biochem. 123, 143-146, 1982; Clegg, Anal. Biochem.,127, 389-394. 1982). The other haif of the 
nitrocellulose filter was incubated with monoclonal antibody 6 and treated as described in Example 20. 

The 33K protein band reacts with the concanavalin A-peroxidase reagents while none of the 30Kd 28Kd 25 
25Kd protein bands or the S protein derived from pRIT12363 containing cell reacts. In the presence of 0 1M 
methyl alpha-D-mannopyranoside the binding of concanavalin A to the 33K protein Is abolished. 

Example 26. An oligosaccharide structure is N-glycosidicaily bound to asparaglne 

PRIT12660 and pRIT12377 (Example 17) containing cells were grown In selective medium (YNB + 80 ug/ml 30 
histidine (DCS cir°) or YNB (10S44C Cir°)l to OD 62 0nm= 0.2. Tunicamycin (Caiblochem) was added to a final 
concentration of 10 ug/ml and the cultures incubated for 30 min at 30°C. Then 20 ^Ci ^methionine (1000 
Ci/mmoi) (New England Nuclear) per ml was added and the cultures were incubated another 15 minutes 2 ml 
of labeled cell culture, either treated or untreated with tunicamycin, were centrtfuged In a microfuge and the 
packed ceils were resuspended in 40 pJ Wo SDS and disrupted by agitation with a Vortex mixer for 2 min. in the 
presence of 0.3 gram glass beads (diameter -0.45- 0.50 mm). 

Immune precipitation was essentially according to Julius et aUCell, 36, 309-318, 1984). The lysate was 
heated for 3 minutes (min.) at 100°, diluted with 0.5 ml PBS containing 10/0 Triton X100, 0.5<Vb sodium 
deoxychoiate 0.1o/o SDS and heated again for 1 min. at 100°. 

25 pj of a 100/0 suspension of prewashed Immuno-precipitln (Formalin-fixed Staph A cells, Bethesda 40 
Research Laboratories) was added to the boiled extract and Incubated at 0° for 30 min. The mixture was 
clarified by centrifugation for 5 min. in a microfuge. To the supernatant liquid, 2.5 jil monoclonal antibody 6 
specific for S protein (see Example 20) was added, and the mixture was incubated for 1 hour at 0°. Another 
portion (25 of Immuno-preclpitin was then added and incubated for 30 min. at 0°. The immune complexes 
were collected by centrifugation (5 min. In a micro fuge) and washed with PBS containing 2M urea PBS 45 
containing Wo 2-mercaptoethanoi and finally PBS. The final washed pellets were resuspended in sample 
buffer for electrophoresis as in Example 20. 

Samples containing 10* cpm were electrophoresed through a 12.5<>/o SDS polyacrylamide gel (see Example 
20). After electrophoresis, gels were fixed in 40<Vb methanol 100/0 acetic acid, treated with Amplify (Amersham) 
for fluorography, dried on filter paper under vacuum and subjected to fluorography using Kodak X-Omat S film 50 
at ~70°. 

This showed that pRIT12660 containing cells synthesize a 33Kd protein In the absence of tunicamycin and a 
%^fJH ein in the P resence of tunicamycin. Both bands are absent In the corresponding samples of 
PRIT12377 containing cells. H 

This results shows that the 33Kd preS2-S protein expressed in yeast strains DC5 cir° carries an 55 
oligosaccharide portion N-giycosydically bound to asparagine. There is most probably only one 
oligosaccharide chain attached to the 33Kd form of the preS2-S protein which is not much larger than a core 
giycosyiation. * 
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Example 27. The oligosaccharide structure is of the high-mannose type. 

Crude cell extracts from pRIT12660 containing cells or partially purified preparations of preS2-S containing 
particles from these extracts (Example 21) were treated with Endo-N-acetylglucosamlnidase (Endo H.« 
EC.3.2.1.96) from Streptomycesplicatus obtained from New England Nuclear. 

Partially purified particles (50 pJ. 450 jig/ml protein) were boiled for 3 minutes at 100° in the presence of 
0.10/0 sodium dodecyl sulphate, diluted ten-fold with 100 mM sodium-citrate pH5 and to 240 \s\ of this mixture 65 
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2.4 nl Endo H (74 jig/ml) was added. Samples with and without added Endo H were incubated for 4 hrs at 37°. 
Analysis of the Endo H treated and untreated samples by SDS-poIyacrylamide gel electrophoresis and 
immunoblotting with detection by the monoclonal antibody 6 directed against the S protein as described in * 
Example 20, showed the disappearance of the 33Kd band and the appearance of a 31 Kd band upon Endo H 
5 treatment. The 31 Kd band still reacts with the concanavalin A-peroxidase reagents as described above. The 
position of the other bands (30Kd, 28Kd and 25Kd) did not shift upon Endo H treatment. Longer Incubation 
times or higher Endo H concentrations gave identical results. The result shows that the oligosaccharide 
structure linked to the 33Kd protein is of the high-mannose type and has about the size of a core glycosylation. 

10 Example. 28 Purification of preS2-S protein containing particles. 

Crude yeast extract containing preS2-S particles is prepared by grinding the washed yeast cells in a 
Dynomili in the presence of an equal weight of extraction buffer, either 50 mM sodium phosphate pH 8.1 1 4 mM 
EDTA 1 .OO/o Tween 20, 4 mM PMSF and 100/o isopropanol or 200 mM tris-HCI pH 9.0, 3.0 M NaCI, 10 mM EDTA, 
10 mM EQTA (ethylene glycol bis (beta)aminoethyl ether N,N,N\N'-tetra acetic acid), Wo Tween 20, 4 mM 
15 PMSF and 10% isopropanol. 

The homogenate is adjusted to pH 8.0 or 9.0 respectively and subsequently centrifuged for 45 minutes at 
30,000 X g. 

To 500 ml of the crude yeast extract, adjusted to pH 7.2, is added 10 gr colloidal Silica (Aerosil 380 Degussa). 
The colloidal suspension is stirred over night at 4° and subsequently centrifuged at low speed. The Aerosil 

20 pellet is washed three times with 150 ml 0.15 M NaCI for 1/4 hour and then eluted batchwise with 250 ml 10 mM * 
pyrophosphate buffer (pH 9.3) containing 2°/o Tween-20 at 37° for 3 hours. The desorbate, a yellowish, lightly 
opalescent solution, is passed over a 50 ml column of phenylboronate Agarose (Amicon) equilibrated in 10 mM 
pyrophosphate buffer pH 9.0. The flow rate is 15 ml/hour. 
The column is further washed with 2 column volumes of equilibration buffer, and is then eluted in the * 

25 reversed direction with 100 mM sorbitol in 50 mM Tris (pH 7.2) at a flow rate of 15 ml/hour, Eluted peak 
fractions are pooled and after adjustment to pH 8.1 , passed over a DEAE-Fractogei TSK column equilibrated in 
50 mM Tris pH 8.1. The particle is eluted in the same buffer containing 75 mM NaCI. After this step residual 
nucleic acids were removed (<1 (ig/20 jig protein). The balance sheet is shown in Table VI. 

30 
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Table V. Two step purification of preS2-S containing 
particles 



mg S-related m g 
protein mg mg poly- 

ffil (SIA) protein lipids saccharide 



Fraction 



10 



Crude 
extract 



500 



11.3 14,600 



5,286 



7.850 



15 



Desorbate 270 
aerosil 



5.96 



385 



554 



249 



20 



Flow 

through 
PBA-coluran 



315 



2.83* 



365 



360 



267 



25 



Eluate 
PBA-column 



70 



2.8 



2.9 



2.5 



30 



*not sufficiently glycosylated to be retained on the 35 
column. 

Material purified from extracts of cells containing pRIT12660 as described in this Example 28, followed by a 40 
CsCI equilibrium centrifugation, was prepared for electron microscope examination after staining wfth uranyl 
acetate. Examination showed the presence of discrete particle structures about 19 microns In diameter, and 
that such hybrid particles contained between 5 to 20 jtg Tween 20 per 100 ng protein. 

Example 29. Affinity chromatography on lentil sepharose. 45 
In this example, the phenylboronate agarose chromatography step is replaced by affinity chromatography 

on lentil sepharose (Pharmacia). 
50 ml of Aerosil-desorbate, obtained as described in Example 28, was passed over a 5 ml column of lentil 

Agarose equilibrated in equilibration buffer such as Tris buffer 10 mM (pH 7.2), 0.14 M NaCI and 0.3<Vo Tween. 

The column was washed with 2 column volumes of equilibration buffer such as Tris Buffer 10 mM (pH 7.2) and 50 

the eluted with sugar additive with specificity for lentil such as 0.5 M aipha-methylmannopyranoside In 

equilibration buffer. The balance sheet is shown in Table VII. 
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Table VII, Application on lentil Sepharose affinity 
chromatography in the purification on 
preS2-S containing particles > 



Volume S- re la ted Total 

Fraction (ml) protein (RIA) Protein 

Crude 100 2170 3 gr 

Desorbate aerosil 50 1700 138 mg 

Flow through 50 428 
Lentil-sepharose 

Elua.te 12.5 925 1.26 mg 
Lentil-sepharose 



Examination revealed that the hybrid particles purified as described herein contained between 5 to 50 jag 
Tween 20 per 100 ng protein. 

Example 30. Immunogenicity of the preS2-S particle. 

The immunogenicity of particles prepared from extracts of cells containing pR!T12660 (Example 16) was 
studied in 2 strains of mice: Balb/c (hb-d) and NMRI (H2-Q). The particles were absorbed on AI(OH)3 before 
injection and the mice (5 mice/group) were injected intraperitoneally (LP.) with 0.3 |ig of RIA reacting material. 
One month after the immunization, the mice were bled and the sera from the mice of the same group were 
pooled. The titer of anti-S antibodies was determined in the pools by using the AUSAB kit (Abbott Labs) and 
the WHO reference. The titers were converted in mlU/ml by using the Holiinger formula (Hollinger et al M in 
Szmuness et al., eds. Viral Hepatitis, Philadelphia : Franklin Institute Press, 451-466, 1982). The anti-preS2 
antibodies were determined using a solid phase assay (RIA) in which the synthetic preS2 peptide given in 
Example 23, is absorbed on plastic and bound antibodies were detected with a goat anti-mouse IgG labelled 
with 12S I. The results summarized in Table VIII show that the particles from pRIT12660 containing cells induce 
both anti-S and anti-preS2 peptide antibodies in both strains of mice with higher titers of anti-preS2 peptide 
antibodies induced in NMRI than in Balb/c mice. The anti-S titer obtained In Balb/c or NMRI mice is 2 to 3 times 
higher than the titer obtained with a higher dose of particles from pRIT123§3 containing cells which do not 
contain the preS2 sequence. PreS determinants enhance the response to the S determinants. This enhancing 
effect is not shown if the preS2 synthetic peptide is injected together with particles which lack the preS2. 
Moreover, the preS2 synthetic peptide alone is unable to elicit anti-preS2 antibodies. 

This indicates that the association of preS2 determinants and S determinants on the same particle is 
required to obtain an anti-preS2 response. Furthermore this association results in an improved anti-S 
response. 

The results of the immunogenicity studies referred to herein related to the PreS2-S particle are summarized 
in Table VIII. 
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Table VIII » Immunoaenicity of the preS2-S particle 







Titre of 






Preparation Dose of S 


Anti-S antibodies 


Anti-preS2 


5 


injected related 


(mIU/ml)(2) 


antibodies (3) 




antigen 
(vg)(D 








DdlD/C NHR1 


Balb/c NMR1 


10 


Yeast-derived 
S particle 


1.7 


1404 719 


0 z 


15 


directed by 










PRIT12363 








20 


Yeast-derived 


1.55 


Ixo3 


0 4 




S particle 
directed by 








25 


pRIT12363 










+ synthetic 








30 


preS2 peptide: 










0.08 »g 








35 


Synthetic 




4 1 


0 4 




preS2 peptide: 










0.08 »g 








40 


Yeast-derived 


0.3 


3344 2418 


8 120 




(preS2-S) 








45 


particle 










directed by 










PRIT12660 








50 



55 



60 



(1) determined by using the AUSRIA Kit (Abbott Labs) and the Proposed International Reference 
Preparation supplied by Dr. Schild, National Institute of Biological Standardization, London, as standard. 55 
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(2) determined by using the AUSAB kit (Abbott Labs) and the World Health Organization standard. 

(3) determined by using a solid phase radio immuno assay and synthetic preS2 peptide adsorbed to 

plastic. ? 
The titer is the dilution which gives a significant binding of the antibody (2.5 times the value obtained with a 
5 negative control serum). 

EXAMPLES OF THE EXPRESSION OF THE ENTIRE PreS1 -PreS2-S PROTEIN CODING SEQUENCE IN YEAST 



Example 31. Construction of plasmld pRIT12845 that expresses the preS1-preS2-protein in yeast 
10 The first step was the fusion of the TDH3 promoter region with the pre S1 N-termina! region from the HBV 
genome. The starting material was plasmld pRIT12792 that harbors a 495 bp Ncol-Xbal fragment containing 
the pre S1-Pre S2 coding sequence except for the last asparagine codon. The Ncol site overlaps the ATQ 
codon of amino acid residue -163 of the HBV pre S1 sequence. The sequence of the Nco l-Xbal fragment is 
shown in Figure 4A. 

15 An essentially similar plasmid to PRIT12792, identified as pRIT12793 (5940 bp), was deposited in the 
American Type Culture Collection, Rockvilie, Maryland, U.S.A., in accordance with the regulations of the 
Budapest Treaty under accession number ATCC 67193 on September 5, 1986. From plasmid PRIT12793, a 495 
bp Nco l -Xba l fragment can be recovered which contains a complete pre S1-pre S2 coding sequence. This 
fragment differs from that shown in Figure 4A by one nucleotide having the sequence ACGAACTAATAATC- 

20 TAGA at the 3' end of the fragment. The nucleotide difference is underscored. In the following examples, 
pRIT12792 can be replaced with pRIT12793. Both the Ncol-Xbal fragments contained pRIT12792 and 
pRITI 2793 were derived by in vitro manipulation of DNAfrom the PreS1-PreS2 region of the DNA of Hepatitis B 
virus strain (serotype adw) cloned on plasmid pRiT10616. Plasmid pRIT10616 is available from the American 
Type Culture Collection under accession number ATCC 39131. 

25 52 \lq of pRIT12792 were digested with the Nco l and Xba l restriction enzymes and treated with Mung Bean 
nuclease in order to eliminate the 5' extensions. 600 ng (2 pmol) of the treated and purified 495 bp Nco l Xba l 
fragment were ligated to 170 ng (0.02 pmol) of pRIT12314 vector that had been previously opened by BamH I 
and Smal enzymes and also was treated with mung bean nuclease. pRIT12314 contains the entire 2 micron 
fragment from S. cerevisiae and an expression cassette in which the TDH3 promoter is separated from the 

30 ARG3 terminator by a BamHI-Sma l -EcoR I linker: 



35 



BamHI EcoRI 



Smal 



40 

ATGGATCCCCGGGAATTC .... 

pTDH 3 ' tARG 3 



A more detailed description of the construction of pRIT12314 is found in Example 10(B) above. The ligation 
mix described above was used to transform E. coji MM294 to ampicillin resistance according to the method of 
Cohen et al., cited above. Six transformants containing a plasmid in which the pre S1-pre S2 fragment was 
Inserted in the correct orientation were retained. These were the plasmids pRIT12843 (a.....f) (Figure 18). 

50 The second step was the insertion into each plasmid pRIT12843 (a f) described above, of the S coding 

region to reconstitute the entire pre S1-S2-S gene. 

This was done as follows: plasmids pRIT12843 (a.....f) were each digested with BamH I and Sail enzymes. The 
BamH I site is placed at the beginning of the pre S2 region, the SaJI site is placed behind the ARG3 terminator in 
pBR 327. 80 ng (0.012 pmol) of the largest Bam H I -Sal I fragment (10,400 bp) purified from each plasmld 

55 pRIT12843 (a f) were ligated to 300 ng (0.11 pmol) of the small BamH I -Sal l fragment from pRIT12660 

(Example 16). The small (4800 bp) BamH I -Sal l fragment of pRIT12660 contains part of the pre S2-S coding 
sequence followed by the ARG terminator sequences and the LEU2 yeast sequence. After ligation and 
transformation of E. co|i MM294 with each of plasmids pRIT12843 (a.....f) according to the method of Cohen et 
al., cited above, a series of ampicillin resistant clones containing the plasmids named pRIT12845 (a.....f) was 

GO obtained. Figure 18 is a flow sheet illustrating preparation of pRIT12845 (a.....f). These plasmids pRIT12845 
(a.....f), were used to individually transform the S. cerevisiae strain 10S44C cir° according to the method of Ito 
et ai. t cited above. The screening of cultures grown from individual clones from the yeast transformations was 
done by testing crude cells extracts in the commercially available AUSRIA test (Abbott). Crude cell extracts 
were made by disrupting yeast cells resuspended in PBS, containing 0.5% Tween 20, 2 mM PMSF and 5<>/o 

OS isopropanol. One transformant of each of the six separate transformations [with pRIT12845 (a.....f)] was 
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tested. Five out of six tested transfomrants gave a positive AUSRIA. 

To test whether a protein of the correct size gave the S-related antigenicity detected by the AUSRIA, sam 
pies of the crude extracts described above, were subjected to the immunoblot procedure described in 
example 20. Four of the five AUSRIA positive transformants showed a S-related protein of about 39K. One 
transformant (extract a) showed a S related protein of 23K. Retained as pRIT1 2845 was the yeast tranf ormant 5 
expressing the 39K protein present in extract (e). See, Figure 18. 

Example 32. The preS1-preS2-S protein is assembled into particles carrying a receptor for polymerized human 
serum albumin and a preS1 epitope on their surface. 

Crude cell extracts from yeast strain 10S44C cir° harboring either pRIT12845, pRIT12660 or pRIT12377 and w 
yeast strain DCS cir° harboring pRIT12363 were prepared as described in Example 19. Total protein 
concentration, HBsAg related antigenicity and detection of a receptor for polymerized human serum albumin 
were performed as described in Example 19. The result showed a strong binding activity for polymerised 
human serum albumin present in extracts from strain 10S44C cir° containing pRiT12843 and pRIT12660 (see 
also Example 10) which is absent in extracts from strain 10S44C clr° harboring pRIT12377 or strain DCS cir° 15 
harboring pRIT12363. 

CsCI equilibrium centrifugation of crude cell extracts from 10S44C clr° cells containing pRIT1284S 
(described in Example 12) and measurement of HBsAg related antigenicity by AUSRIA assay in the CsCI 
fractions, showed the antigen to band around rho - 1 .2 g/cm 3 . This was confirmed by immunoblot analysis of 
the CsCI fractions. Pre St antibody binding activity In the CsCI fractions was tested by the ELISA assay 20 
described in Example 12. After allowing the antigen to bind to the solid-phase antl-HBsAg, the bound antigen 
was incubated with monoclonal antibody MA18/7 (Example 31) which was then detected by blotinylated 
anti-mouse Ig, streptavidin-biotinylated horseradish peroxidase com- plex and chromogen as described In 
Example 12. 

The pre S1 specific antibody binding activity was found to coequilibrate with the HBsAg related antigenicity 25 
In the CsCI gradient. These results show that the pre S1-pre S2-S protein produced in 10S44C cir° cells 
harboring pRIT 12845 is assembled into particles like the 22 nm HBsAg particles derived from serum. The 
particles expose pre S1 specific sequences as well as a receptor for polymerised human serum albumin on 
their surface. From the level of activity of those particles in the AUSRIA test and the Intensity of the reaction of 
the pre S1-pre S2-S protein in an immunoblot analysis, it can be concluded that the particle's S determinants 30 
are partially obstructed or deformed by the presence of pre S sequences In the particle. 

Example 33. Two proteins of molecular weight of about 38 and 45 kilodaltons (kD), both carrying preS1 , preS2 
and S epitopes are expressed in yeast transformed with RIT12845 

HBsAg related protein monomers expressed in pRIT12845 containing cells were analysed as described in 35 
Example 20. 

Migration and immune reactivity of the cellular proteins were compared to the proteins from HBsAg 
particles (serotype ad) purified from human serum (obtained from Dr. W.H. Gerlich, Department of Medical 
Microbiology, University of Gottingen, Federal Republic of Germany). 

Three immunoreagents were used to analyse the proteins: 40 

- the monoclonal antibody (obtained from H. Thomas -Example 20) recognizing a S epitope 

- a polyvalent rabbit antiserum raised against the synthetic pre S2 peptide (described in Example 23) 

- monoclonal antibody MA18/7 specific for a pre S1 epitope as described by Heermann et al. (J. Virol.. 52, 
396-402, 1984). — 

The immunoblot showed, among the cellular proteins derived from pRIT12845 containing 10S44C ceils, the 45 
presence of two proteins reacting specifically with ail three immunoreagents described above. The molecular 
weight of these two proteins was estimated to be about 38,000 and 45,000 daltons. The molecular weight 
estimation was based on the migration of the proteins from serum derived HBsAg as defined by Heerman et al. 
(J. Virol., 52, 396-402, 1984). The 38,000 daiton pre S1-pre S2-S protein directed by pRIT12845 migrates slightly 
faster than the p39 pre S1-pre S2-S protein derived from HBsAg particles. The p39 protein from human serum 50 
particles of serotype ad presumably has a pre S1 region of 119 amino acids (Heermann et al., J. Virol. , 52, 
396-402, 1984). pRIT12845 codes for a pre S1 region of 108 amino acids. 

The above results show that yeast cells harboring pRIT12845 express two proteins erf about 38 and 45 kD 
carrying pre S1, pre S2 as well as S epitopes. 

Example 34. The 45 Kd preS1 -preS2-S protein species carries a N-glycosidically bound high-mannose type 
oligosaccharide chain 

Cells harboring pRIT12845 were disrupted with an equal weight of glass beads (diameter 0.45-0.50 mm) In 
an equal weight of 10 mM sodium phosphate buffer (pH 7.4) containing 29/o SDS and 8 mM EDTA by agitation 
with a Vortex mixer. 

After centrifugation, the supernatant was heated for 4 minutes at 100°. 

10 nl of the heated supernatant fluid was diluted with 40 nl 25 mM sodium citrate buffer (pH 5.0), and 2 jjJ 
EndoH (74 jtg/ml) (see, Example 27) was added. Samples with and without added EndoH were Incu- bated for 
2.5 hrs at 37°. 

The EndoH treated and untreated samples were analysed by SDS-poIyacrylamlde gel electrophoresis and 65 
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immunoblotting as described in Example 20. 

The blot was analysed with monoclonal antibody 6 (S specific) and MA18/7 (pre S1 specific) (see, Example 
33). ? 

The immunoblot showed both with monoclonal 6 and monoclonal MA18/7, the disappearance of the 45 kD 
5 band and the appearance of a 41 kD band upon Endo H treatment. The migration of the 38 kD band was not 
affected by the Endo H treatment. 

These results show that the 45 kD pre S1-pre S2-S protein species carries a N-glycosidically bound 
high-mannose type oligosaccharide chain. 

10 Example 35. The 38kD preS1 -preS2-S protein contains covalently bound myristic acid present in an amide 
linkage 

Cells from yeast strain 10S44C cir° harboring either pRIT12845, pRIT12660 or pRIT12377 were grown in YNB 
to an OD620nm of 0.4. 

Cells (20 ml culture) were labeled for 60 min with 1 mCi of 3H-Iabeled myristic acid (New England Nuclear) 
15 and then collected by centrifugation. Cells were washed with cold H2O. resuspended in 100 yd 5 mM tris-HCI 
(pH 7.4) containing 3 mM (DTT, 1°/o SDS and 1 mM PMSF, and broken with 100 mg glass beeds by six 
30-second spurts of vigorous mixing in a Vortex, cooling on ice between each mixing. Debris was removed by 
centrifugation for 1 minute in a Eppendorf 5414 centrifuge (Towier and Glaser, Proc. Natl. Acad. Sci., 83, 
2812-2816, 1986). 20 |il of the extract was treated for 3.5 hrs at room temperature with 7 of freshly prepared 4 
20 M hydroxylamine, 20 mM glycine, pH 10. ? 
Hydroxylamine treated and untreated samples were subjected to SDS polyacrylamide gel electrophoresis 
and fluorography as described in Example 26. The hydroxylamine untreated samples were also subjected to 
immunoblot analysis with the monoclonal antibody recognizing a S epitope, as described in Example 20. 
The immunoblot showed results identical to those described in Examples 20 and 33. - 
25 The fluorograph showed a labeled band of 38kD present only in the cellular extract from cells harboring 
pRIT12845, which was hydroxylamine-stable, indicating that the 3H-myristate label is present in an amide 
linkage. No labeled bands specific for the extracts from ceils harboring pRIT 12660 were detected. 

These results show that the 38kD pre S1-S2-S protein contains covalently bound myristic acid present in an 
amide linkage. In general, myristate becomes covalently bound to proteins via amide linkage to an 
30 amino-terminal glycine. This suggests that the initiator Met has been removed from the pre S1-pre S2-S 
protein and that the Qly residue at position 2 has become available for acylation with myristic acid. 

Example 36. Use of preS1 vectors for insertion of functional DNA coding sequences 
The starting material is DNA of plasmid pRIT 12793 described in Example 31 above. pRIT12793 contains the 

35 pre S1 -pre S2 coding sequence on a 495 bp Nco l -Xba l fragment. DNA of plasmid pRIT12793 is digested with 
Ncol restriction endonuclease, treated with T4 DNA polymerase to fill in the sticky ends and digested with Xba l 
endonuclease. The Ncol/T4 DNA polymerase /Xba l fragment is purified by polyacrylamide gel electrophoresis 
and electroelution and inserted on vector pUC12 previously digested with Smal and Xba l endonucleases.The 
cloning vector pUC12 Is commercially available from Amersham (Little Chaifont, Bucks, United Kingdom) and 

40 from Pharmacia (Piscataway, NJ). The plasmid pRITX is obtained as shown in Figure 5B. In the pre S1-pre S2 # 
region of plasmid pRITX, a single Nco l site overlaps the ATG start codon of the pre S1 region, and a Xba l site is 
present adjacent to and downstream of the TAA stop codon at the C-terminus of the pre-S2 coding 
sequences. A single BstX restriction site is present on the plasmid close to the N-terminal end of the pre S1 
coding region. ? 

45 It will be appreciated by one skilled in the art that the vector pRITX or similar derivative vectors containing 
the pre S1-S2 sequences may be used for introduction of further functional DNA coding sequences encoding 
peptides of interest to create in phase fusions within the pre S1 region by insertion at the Bst X site. 

Furthermore, functional DNA coding sequences may be introduced by digestion of pRITX and derivatives 
with combinations of BstX and BamHI or EcoR I or Pstl endonucleases and insertion of the functional DNA 

50 coding sequence between these sites, thus removing the greater part of the pre S1 coding region and the 
N-terminal part of the pre S2 sequence. Such fusions will be of the type pre S1-introduced functional DNA 
coding sequence pre S2. Once such fusions are made, they can be combined onto other plasmids containing 
the rest of the pre S1~pre S2 sequences and sequences necessary for maintenance and replication in E. coll 
and S. cerevisiae as exemplified in Example 16A. 

55 

Example 37. Fusion of peptide from proteins of HTLV-III virus to the preS2 region of preS2-S sequences to 
create novel hybrid particles 

The preS2 region of a HBV preS2-S coding sequence can also be used for the insertion or fusion of peptide 
sequences of other viral antigens including those of the causative agent of human AIDS syndrome, the 

60 retrovirus known as HIV, HTLV-III or LAV. 

The cloned genome of a HIV virus has been described by Ratner L et al. (Nature, 313, 277-284, 1985) and 
Shaw G. et al. (Science, 226, 1165-1171, 1984). From this genomic clone various subfragments can be 
obtained as functional DNA coding sequences which correspond to peptide regions of interest for fusion to 
the preS2-S sequence to form novel hybrid particles containing epitopes of the HIV virus proteins. Such hybrid , 

65 particles can serve as the basis of a human vaccine for protection against the infective agent HIV. 
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Among the peptide regions of interest are the following: 

a. C7 peptide. 

This region corresponds to a stretch of 45 amino acid residues Immediately adjacent to the processing site 
of the envelope precursor and was defined by Starcich B.R. at al. (Cell, 45. 637-648, 1986). 

b. Peptide 121. 

This region corresponds to a relatively conserved amino acid sequence present in several retroviral 
ransmembrane envelope proteins (see Clanciolo Q.J. et al. (Science, 230. 453^55, 1985). This sequence is 

riS„ n °r P ^^ Tt ! Pat ^!n e !L 8 »?f retrovlraUnduced immuTiHsuppresslon (Snyder man R. and 
uianciolo QJ., Immunol. Today, 5, 240, 1984). 

c. 'Pressman peptide*. 

A synthetic peptide corresponding to amino acid sequence 735 to 752 of the precursor envelope 
glycoprotein of HIV was used by Kennedy R.C. et al. (Science. 231 , 1556-1559, 1986) to Immunize rabbits and 15 
rabbft antiserum was used to Investigate the recognition of HIV envelope proteins by Induced antibodies The 
results suggest that this region could induce an Immune response directed against the native glycoprotein. 

on F pRrT10911 8 ° NA S6qUence corres P° nclin 9 to the C7 peptide from a HTLV-III isolate to the preS2-S region 

The starting material was plasmid pBH10-R2 containing the cloned genome of HIV Isolate BH10 This " 
plasmld was obtained from R.C. Gallo, National Institutes of Health. Bethesda, MD. A 3108 bp Sall-Xhol 
* 9m ~l 0f DNA of Viral orig,n which wa ^ excised from PBH10-R2 and cloned between the Sail and XhHl iitis 
of pUC18 to create plasmld pRrT12901. A restriction map of this Sall-Xhol fragment is sh5wn in FEure IB 
Plasmld pUCI8 is described by Norrander J. et al. (Gene, 26, 101-10671983) and Is commercially available from 25 
Pharmacia Inc.. Piscataway N.J. 

To create the fusion. DNA of pRIT12901 was digested with BgJII and Mboll endonucleases, and the 117 bp 
fragment Isolated by acrylamide gel electrophoresis and electroelutlon (iie, Figure IB). DNA of this fragment 
W mT™ 8 7 Mung Bean nuclea8e t0 remove sln^e stranded extensions and llgated with DNA of 
PRIT1091 1 (see. Example 5, above) which had been digested with BamH I endonuclease and treated with Mung 30 
Bean nuclease. From this ligation mixture a plasmid, pRIT12893. wasisolated In which the 117 bp Bglll-Mboll 
fragment from pRIT12901 has been Inserted on pRm0911. The nucleotide sequence across "the fGiton 
junction between the TDH3 promoter and the 117 bp HTLV-III BgJII-Mboll. mung bean nuclease treated 
fragment insert on pRIT12893 was determined and it was found thatlfTnucleotldes In excess had been 
removed from the BgJII end of the fragment. The sequence found was & 

5 ' ATGGAGGAGGAGATATGAGGGA 3 1 

where the first ATG codon underscored Is that of the TDH3 promoter region from pRIT1091 1 and the second is 40 

. ATa f*~? n ,ntemaI to the 07 pe P tide frasment. The second ATG condon overlaps a TGA termination 
codon. This termination codon is in the same reading frame as the first ATG codon. DNA sequence 
e !u"?!^, 0n ° f th8 3 end of the fusion re9lon on PRIT12893 showed the correct sequence for in phase fusion 
of the Mboll-mung bean nuclease treated end of the HTLV-III fragment to the preS2 sequence on pRIT1091 1 as 
shown in Figure 2B. ■ 45 

DNA of pRIT12893 was then digested with Hindlll endonuclease and the 3250 bp fragment isolated and 
*SS£2 1? fl e, , e ! e ^ r °P horesis electroelutlon. The 3250 bp Hindlll fragment was then Inserted at 
the Hindlll site of plasmld YE P 13 to create pRrT12894. DNA of the plasmldl^s then used, to transform cells of 
yeast strains DC5 and 10S44C to leucine Independence by the method of Ito et al. as cited In Example 10 

above. §q 

Transcription and translation in yeast cells of the C7-preS2-S fusion DNA on pRIT12894 will result in 
synthesis of a small 5 residue peptide commencing at the TDH3 ATG codon and terminating at the TGA codon 
underscored above and of the C7-preS2-S fusion peptide commencing at the second ATG codon Internal to 

2? l^V^T^ fJ" 8 ' 0 " COntaJnS 38 amlno *** residues from the 07 peptJde Place of the 45 residues 
from the Intact Bjlll-Mboll-mung bean nuclease treated C7 fragment. It will be appreciated that other plasmlds 55 
recovered from the same ligation mixture as pRIT12893 can be examined and sequenced to determine If thev 
carry a fusion corresponding to the complete fragment. 

B. Fusion of the DNA sequence corresponding to the Peptide 121 to the preS2-S region on pRIT10911 

To create the fusion. DNA of pRIT12901 (described in Part A. above) was digested with Hhal and Hindlll 60 
endonucleases to liberate the 230 bp fragment shown In Figure 1 B. This fragment was purified «ryS" 
gel electrophoresis and electroelutlon and treated with T4 DNA polymerase to fffl in single strand extensions 
The so treated fragment was llgated with DNA of P RrT10911 which had been digested with BamHI and Mung 
Vf^™ 888 ' ,. r0m " 9ation mlxture> a plasm,d was Identified as pRIT12897 on which the~Bobp fragment 
of HIV DNA has been Inserted in the correct reading frame of fusion to the preS2 sequence of pRI1091l as 65 
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shown in Figure 3B. DNA of plasmid pRIT12897 was then digested with Hind lll endonuclease, and the 3365 bp 
fragment containing the TDH3 promoter, HlV-preS2-S fusion and the ARG3 terminator was purified by agarose 
gel electrophoresis and electroelution. This 3365 bp fragment was iigated onto YEp13 digested with Hindlll 
endonuclease to form plasmid pRIT12898. DNA of this plasmid (pRIT 12898) was then used to transform cells 
5 of yeast strains DC5 and 10S44C to leucine independence by the method of Ito et ai. as cited in Example 10, 
above. 

C. Fusion of the DNA sequence corresponding to the "Dreesman°peptide to the preS2 region on pR(T1091 1 . 
A 60 bp fragment of synthetic DNA with the sequence shown below was synthesized by conventional means 
10 as two single strands which were then annealed together: 

5 • GATCTCGACAGGCCCGAAGGAATAGAAGAAGAAGGTGGAGAGAGA 3 1 
3 • AGCTGTCCGGGCTTCCTTATCTTCTTCTTCCACCTCTCTCT 5 1 
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5» GACAGAGACAGATCCCCG 3' 
3' CTGTCTCTGTCTAGGGGCCTAC 5» 



This fragment has 5' Bglll and 3' BamHI single strand extensions. About 200 ng of a double stranded 
25 fragment and 300 ng of DNA of plasmid pRIT10911 digested with BamHI were mixed and Iigated. From this 
ligation mixture, a plasmid, pRIT12899, was identified and recovered on which the 60 bp fragment has been 
inserted at the BamH I site of pRIT10911 so as to create an inframe fusion as shown in Figure 4B. DNA of 
pRIT12899 was digested with Hind lll endonuclease, and the 3200 bp Hind lll fragment was purified by agarose 
gel electrophoresis and electroelution. This Hind ill fragment was inserted at the Hind lll site of YEp13 to create 
plasmid pRIT12900. DNA of plasmid pRIT12900 was then used to transform ceils of yeast strains DCS and 
10S44C to leucine independence by the method of Ito et al. cited in Example 10, above. 



Example 38. Expression of fusion proteins from pRIR12894 and pRIT12898 

Yeast strains DCS or 10S44C harboring either YEp13, pRm2363, pRiT10912, pRIT12894 or pRIT12898 were 
separately grown in liquid medium lacking leucine (YNB + 80 jig/mi histidine or YNB), and cellular proteins 
were analyzed by the immunobfot procedure as described in Example 11 using as the first antibody a 
monoclonal antibody directed against a denaturation and reduction resistant epitope of human derived HBsAg 
(monoclonal 6 obtained from H. Thomas, Royal Free Hospital, London, England). pRIT10912 is described in 
Example 5 above. pRlT12894 and pRIT12898 are described in Example 37, above. pRIT12363 carries the S 
40 gene of HBV fused to the TDH3 promoter as the 2900 bp Hind lll fragment from pRIT12322 (Example 8) inserted 
on a yeast vector plasmid identical to pRIT12377 (Example 10, part B above) and produces HBsAg under the 
control of the TDH3 promoter. 

The immunoblot showed a protein band related to S protein of estimated molecular weight of about 32 kD 
among the total cellular proteins from yeast strains DC5 or 10S44C transformed with pRIT12894. A protein 
band related to S protein of estimated molecular weight of about 45 kD was present among the total cellular 
proteins from yeast strains DCS or 10S44C harboring pRIT12898, while a protein band related to S protein of 
estimated molecular weight of about 29 kD was present among the total cellular proteins from yeast strain DCS 
harboring pRIT10912. 

Crude cell extracts from yeast strain DC5 harboring either pRIT12363, pRITl6912, or pRIT12894 were 
50 prepared as described in Example 12. Protein concentration and AUSRIA activity were measured as described 
in Example 12. Immunoblot analysis of these extracts was performed as described above. 

CsCI equilibrium centrifugation of the crude extracts, (described in Example 12) and measurement of 
HBsAg related antigenicity by AUSRIA assay in the CsCI fractions, showed the antigen to band around 
rho = 1.2 g/cm 3 . This was confirmed by immunoblot analysis of the CsCI fractions. 

These results show that the 32 kD fusion protein produced in yeast strain DCS cells carrying pRlT12894 is 
assembled into particles like the 22 nm HBsAg particles derived from human serum. From the level of activity 
of those particles in the AUSRIA test, and the intensity of the reaction of the 32 kD protein in the immunoblot 
analysis, it can be concluded that the particles' S determinants are partially obstructed or deformed by the 
presence of C7 sequences. 

From these results it is concluded that yeast cells of strains DCS and 10S44C transformed with either 
PR1T12894 or pRIT12898 specifically synthesize a fusion protein of molecular weight of about 32KD or 45 kD 
respectively, both carrying a S epitope. 

It wiii be appreciated by one of skill in the art that hybrid particles carrying the HIV virus peptide epitopes can 
be extracted and purified by conventional means from cultures of yeast cells transformed with pRIT12894, 
pRIT12898 and pRIT12900. Such purified particles may then be formulated in the presence of adjuvants such 
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as A (OH) 3 or AIPO* as vaccines for human use. The preferred dose will be in the range of 1-1000 jig protein 
det f„ rmined 1 ^ conventional trials. It will also be understood that the invention as exemplified herein 
includes other peptide regions from HIV viral proteins which can be fused to the preS2-S region of HBV coding 
sequences to form hybrid particles presenting the HIV epitope of interest. a 



Claims 



1. A recombinant DNA molecule comprising a functional DNA coding sequence fused in phase to a 
portion of the Pre S2 region of a HBV Pre S2-S protein coding sequenca wherein the functional DNA 
coding sequence comprises the PreS2 coding sequence, PreS1 coding sequence or entire PreS1-PreS2 
coding sequence, the circumsporozoite protein coding sequence of Plasmodium , or a HIV coding 
sequence such as a HIV envelope gene sequence such as the HIV C7 protein coding region, the HIV 
Peptide 121 coding region orthe HIV Dreesman peptide coding region. 

2. A recombinant vector which comprises a recombinant DNA molecule operatively linked to a 
regulatory region, wherein the DNA molecule contains a functional DNA coding sequence fused in phase 
to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence wherein the regulatory 
region comprises the TDH3 promoter. y 

3. A recombinant vector which comprises a recom binant DNA molecule operatively linked to a 
regulatory region, wherein the DNA molecule contains a functional DNA coding sequence fused In phase 
to a portion of the pre S2 region of a HBV Pre S2-S protein coding sequence wherein the functional DNA 
coding sequence comprises the PreS2 coding sequence, PreS1 coding sequence or entire PreS1-PreS2 
coding sequence, circumsporozoite (CS) protein coding sequence of Plasmodium , or a HIV coding 
sequence such as a HIV envelope gene sequence such as the HIV C7 protein, the HIV Peptide 121 protein 
or the HIV Dreesman peptide. 

A Th «r 3Ctor ° f Claim 3 whioh com P rises a 1 92 bp Sau3A fragment coding for 16 tetrapeptide repeats 
of the CS protein of P-falclparum wherein such fragment Is one which could be derived from Sau3A 
digestion of a 1215 bp Stul - Rsal fragment of plasmid WR201 containing the P. falciparum CS protein 
coding sequence minus the first 52 bp of such sequence. 

5. The vector of Claim 4 which comprises additional Sau3A fragments of CS protein DNA. 

6. A recombinant vector which comprises a recombinant DNA molecule operatively linked to a 
regulatory region, wherein the DNA molecule contains a functional DNA coding sequence fused In phase 
to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence wherein the functional DNA 
coding sequence comprises all or part of the Pre S2 protein coding sequence 

7. The vector of Claim 6 wherein the functional DNA coding sequence comprises the following amino 
acid coding sequence : 

PRE-S2 REGION 



-55 

ATG CAG TGG AAT TCC 
Met Gin Trp Asn Ser 

GCT CTG CAG GAT CCC 
Ala Leu Gin Asp Pro 

TAT TTT CCT GCT GGT 
Tyr Phe Pro Ala Gly 



ACT GCC TTC CAC CAA 
Thr Ala Phe His Gin 

AGA GTC AGG GGT CTG 
Arg Val Arg Gly Leu 

GGC TCC AGT TCA GGA 
Gly Ser Set Ser Gly *" 
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ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn lie Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn. 



8. The vector of Claim 6 wherein the functional DNA coding sequence comprises all or part of the Pre 
S1-Pre S2 protein coding sequence. 

9. The vector of Claim 8 wherein the functional DNA coding sequence comprises the following amino 
acid coding sequence: 

PRE-S1 REGION 
-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG , 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 
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TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGCCCT CAG GCT CAG 
Gly He Leu Gly Trp Set Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Sec Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro lie Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 

PRE-S2 REGION 

-S5 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn He Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 
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GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn. 



10. A eukaryotic host cell transformed with a recombinant vector, wherein said vector is according to 
Claim 5. 

1 1 . The host of Claim 10 which is a yeast ceil. 

10 12. A method of preparing a hybrid particle containing HBsAg protein which contains and/or presents a 

peptide encoded by a functional DNA coding sequence which comprises culturing a eukaryotic host cell 
transformed with a vector in appropriate culture media and isolating the particle from a cell lysate or 
extract of such host's culture, wherein said vector is according to Claim 5. 
13. A particle prepared by the method of Claim 12. 

15 14. A hybrid immunogenic particle comprising HBsAg protein wherein said particle contains and/or 

presents a peptide encoded by a functional DNA coding sequence. 

15. A vaccine comprising an immunoprotective amount of a particle according to Claim 14. 

16. A miceile comprising the particle according to Claim 14 and polysorbate. 

17. A vaccine comprising an immunoprotective amount of a micelle according to Claim 16. 
20 18. A HBV Pre S1 protein coding region which codes for the following amino sequence: 
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-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly lie Leu Thr Thr Val Ser Thr lie Pco 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Sec Giy 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro lie Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



19. A recombinant DNA vector comprising the Pre S1 protein coding region of Claim 18 operatlveiy 
linked to a regulatory region. 



55 



0 278 940 



20. A transformed host cell comprising the Pre S1 protein coding region of Claim 18. 

21. Protein encoded by the Pre S1 protein coding region of Claim 18. 

22. A vaccine comprising an immunoprotective amount of the protein encoded by the Pre S1 protein 
coding region of Claim 18. 

23. A HBV Pre S2 protein coding region or a HBV Pre S2-S protein coding region wherein the Pre S2 
portion thereof codes for the following amino acid sequence : 

Thr 

-55 -50 or , 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Glri-Ala-Leu-Gln-Asp- 



-40 -30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 
Ser- 

-20 

Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile-Ser- 



-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn 



24. A recombinant DNA vector comprising the Pre S2 or the Pre S2-S protein coding region of Ciaim 23 
operatively linked to a regulatory region. 

25. A transformed host ceil comprising the Pre S2 of the Pre S2-S protein coding region of Claim 23. 

26. Protein encoded by the Pre S2 or the Pre S2-S protein coding region of Claim 23. 

27. A vaccine comprising an immunoprotective amount of protein encoded by the Pre S2 or the Pre S2-S 
protein coding region of Claim 23. 

28. A particle comprising protein encoded by the Pre S2-S protein coding region of Claim 23. 

29. A vaccine comprising an immunoprotective amount of a particle encoded by the Pre S2-S protein 
coding region of Claim 23. 

30. A micelle containing the particle of Claim 28 and polysorbate. 

31 . A vaccine comprising an immunoprotective amount of the micelle of Claim 30. 

32. A recombinant DNA vector comprising a Pre S1-Pre S2-S protein coding sequence operatively linked 
to a regulatory region wherein the Pre S1 - Pre S2 portion of such sequences codes for the following 
amino acid coding sequence: 

PRE-S1 REGION 
-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 
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GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asa Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro He Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 

PRE-S2 REGION 

-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin ' 
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GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp'Pro Arg Val Arg Gly Leu 



TAT TTT CCT GCT GGT 
Tyr Phe Pro Ala Gly 

ACA GTA AAC CCT GCT 
Thr Val Asn Pro Ala 

CAC ATA TCG TCA AGC 
His lie Ser Ser Ser 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn 



GGC TCC AGT TCA GGA 
Gly Ser Ser Ser Gly 

CCG AAT ATT GCC TCT 
Pro Asn lie Ala Ser 

TCC GCG AGG ACT GGG 
Ser Ala Arg Thr Gly 



33. A yeast host cell transformed with the vector of Claim 32. 

34. Protein prepared by the PreS1-PreS2-S sequence of the vector of Claim 32. 

35. A vaccine comprising an immunoprotective amount of the protein of Claim 34. 

36. A particle comprising protein encoded by a PreS1-PreS2-S protein coding region. 

37. A vaccine comprising an immunoprotective amount of the particle of Claim 36. 

38. A micelle comprising the particle of Claim 36 and polysorbate. 

39. A vaccine comprising an immunoprotective amount of the micelle of Claim 38. 

40. A Pre Sl-Pre S2 protein coding sequence which comprises the following amino acid sequence: 
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Pre SI Region 

-163 

AT6 GGG ACG 

Met Gly Thr 

GGA TTC TTT 

Gly Phe Phe 

GCA TTC GGA 

Ala Phe Gly 

TGG GAC TTC 

Trp Asp Phe 

CCA GCA GCC 

Pro Ala Ala 

TTC GGG CCA 

Phe Gly Pro 

GGT ATT TTG 

Gly lie Leu 

GGC ATA TTG 

Gly lie Leu 

CCT CCT GCC 

Pro Pro Ala 

AGG CAG CCT 

Arg Gin 'Pro 



AAT CTT TCT 

Asn Leu Ser 

CCC GAT CAT 

Pro Asp His 

GCC AAC TCA 

Ala Asn Ser 

AAC CCC ATC 

Asn Pro lie 

AAC CAG GTA 

Asn Gin Val 

GGG CTC ACC 

Gly leu Thr 

GGG TGG AGC 

Gly Trp Sec 

ACC ACA GTG 

Thr Thr Val 

TCC ACC AAT 

Ser Thr Asn 

ACT CCC ATC 

Thr Pro He 



GTT CCC AAC 

Val Pro Asn 

CAG TTG GAC 

Gin Leu Asp 

AAC AAT CCA 

Asn Asn Pro 

AAG GAC CAC 

Lys Asp His 

GGA GTG GGA 

Gly Val Gly 

CCT CCA CAC 

Pro Pro His 

CCT CAG GCT 

Pro Gin Ala 

TCA ACA ATT 

Sec The lie 

CGG CAG TCA 

Arg Gin Ser 

TCT CCA CCT 

Ser Pco Pro 



CCT CTG 
Pro Leu 

CCT 
Pro 

GAT 
Asp 

TGG 
Trp 

GCA 
Ala 

i 

GGC 
Gly 

CAG 
Gin 

CCT 
Pro 

GGA 
Gly 

CTA 
Leu 
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AGA GAC AGT CAT CCT CAG GCC 

Axg Asp Ser His Pro Gin Ala 

Pre S2 Region 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 

Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 

Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 

Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 

Thr Val Asn Pro Ala Pro Asn lie Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 

His lie Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 

Asp Pro Val Thr Asn 



40 41. A recombinant DNA vector comprising the Pre S1-Pre S2 protein coding sequence of Claim 40 

operatively linked to a regulatory region. 

42. A yeast host cell transformed with the vector of Claim 41 . 

43. Protein encoded by the Pre S1-Pre S2 protein coding region of Claim 40. 

44. A vaccine comprising an immunoprotective amount of protein encoded by the Pre S1-Pre S2 protein 
45 coding region of Claim 40. 

45. A recombinant DNA vector comprising a Pre S1 functional DNA coding sequence - Pre S2-S protein 
coding sequence operatively linked to an expression control sequence. 

46. A yeast host ceil transformed with the vector of Claim 45. 

47. Protein expressed by the host of Claim 46. 

50 48. A vaccine comprising an immunoprotective amount of the protein of Claim 47. 

Claims for the following contracting states : Es, Gr 

1 . A method of preparing a transformed eukaryotic host cell which comprises transforming a eukaryotic 
host cell with a vector which comprises a recombinant DNA molecule operatively linked to a regulatory 

55 region containing a functional DNA coding sequence fused, in phase, to a portion of the PreS2 region of a 

HBV PreS2-S protein sequence wherein the functional DNA coding sequence also encodes the 
circumsporozoite (CS) protein of Plasmodium , or a HIV envelope gene sequence such as the HIVC7 
protein, the HIV peptide 121 protein or the HIVDreesman peptide.. 

2. The method of Claim 1 wherein the host is a yeast cell. 

60 3. The method of Claim 2 wherein the host belongs to the genus Saccharomyces . 

4. The method of Claim 3 wherein the host belongs to the species Saccharomyces cerevisiae or 
Saccharomyces carisbergensls . 

5. The method of Claim 4 wherein the host is strain DCS, DCS cir° , 1 0S44C or 1 0S44C or 1 0S44C cir° . 

6. A method of preparing a hybrid particle containing HBsAg protein which contains and/or presents a 
65 peptide encoded by a functional DNA coding sequence which comprises culturing a eukaryotic host cell 
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transformed with a vector in appropriate culture media and isolating the particle from a cell tysate or 
4 extract of such host's culture, wherfch said vector Is a vector which comprises a recombinant DNA 

molecule operattvely linked to a regulfitfory region containing afunctional DNA coding sequence fused, In 
phase, to a portion of the PreS2 region of a HBV Pre S2-S protein sequence wherein the functional DNA 
coding sequence also encodes the circumsporozoite (CS) protein of Plasmodium , or a HIV envelope s 
gene sequence such as the HIVC7 protein, the HIV peptide 121 protein or the HIVDreesman peptide. 

7. A process of preparing a transformed host cell which comprises transforming a host cell with the 
HBV Pre S1 protein coding region which codes for the following amino sequence: 

-163 10 
ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

15 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

20 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT i 
• Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

25 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 

30 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

35 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

# 

40 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

45 
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GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



8. The process of Claim 7 wherein the region is comprised by a recombinant DNA vector. 

9. The process of Claim 7 wherein the host is a yeast cell. 

10. The process of Claim 9 wherein the host belongs to the genus Saccharomyces . 

11. A process for preparing the protein encoded by the Pre S1 protein coding region of Claim 7 which 
comprises cuituring a host cell transformed with the region in appropriate culture media and isolating the 
protein from a cell lysate or extract of such host's culture. 

12. A process of preparing a transformed host cell which comprises transforming a host cell with a HBV 
PreS2 protein coding region or a HBV PreS2-S protein coding region wherein the PreS2 portion thereof 
codes for the following amino acid reference: 

Thr 

-55 -50 or 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala-Leu-Gln-Asp- 



-40 -30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 
Ser- 

-20 

Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile-Ser- 



-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn 



13. The process of Claim 12 wherein the region is comprised by a recombinant DNA vector. 

14. A process for preparing protein encoded by the Pre S2 or the Pre S2-S protein coding region of 
Claim 12 which comprises cuituring a host cell transformed with the region in appropriate culture media 
and isolating the protein from a cell lysate or extract of such host's culture. 

15. A process for preparing a particle encoded by the Pre S2-S protein coding region of Claim 12 which 
comprises cuituring a host cell transformed with the region in appropriate culture media and isolating the 
protein from a cell iysate or extract of such host's culture. 

16. A method of preparing a transformed yeast host cell which comprises transforming a yeast host cell 
with a recombinant DNA vector comprising a PreS1-PreS2-S protein coding sequence operatively linked 
to a regulatory region. 

17. A method of preparing protein encoded by the Pre S1-Pre S2-S protein coding sequence which 
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comprises culturfng a yeast host cell transformed with recombinant DNA vector comprising a 
PreS1-PreS2-S protein coding Sequence operattvely linked to a regulatory region in appropriate culture 
media and isolating the protein from a ceil lysate or extract of such host's culture. 

18. A method of preparing a particle containing polypeptide- encoded by the Pre S1-Pre S2-S protein 
coding sequence which comprises culturing a yeast host cell transformed with a recombinant DNA vector 5 
comprising a PreS1-PreS2-S prbteth coding sequence operativefy linked to a regulatory region in 
appropriate culture media and Isolating the particle from a cell lysate or extract of such host's culture. 

19. A method of preparing a transformed yeast host cell which comprises transforming a yeast host ceil 
with a recombinant DNA vector comprising th6 following PreS1-S2-proteln coding sequence operatively 
linked to a regulatory region. 10 

Pre SI Region 
-163 

ATG GGG ACG AAT CTT T?CT GTT CCC AAC CCT CTG 15 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 20 
Gly Ptie Phe Pro Asp His Gin Leu Asp Pro 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 25 
Ala Phe Gly Ala Asia Ser Asn Asn Pro Asp 

30 



35 



45 



BO 



55 



60 
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TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 

Trp Asp Phe Asn Pro He Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA. GTG GGA GCA 

Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA- CAC GGC 

Phe Gly Pro Gly leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 

Gly He Leu Gly Trp Ser Pro Gin Ala Gin 



GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 

Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 

Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 

Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

- I 

AGA GAC AGT CAT CCT CAG GCC 

Arg Asp Ser His Pro Gin Ala 



Pre S2 Region 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 



GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 

Ala Leu Gin Asp Pro Arg Val Acg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 

Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 
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ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn He Ala Ser 

5 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 

10 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn 

15 

20. A process for preparing protein encoded by the Pre S1-Pre S2 protein coding region of Claim 1 9 
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Asn 


GCA 
Ala 


550 
AAC 
Asp 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 


AAC 
Asn 


600 

CCA 

Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCC 
Pro 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


CCT 
Pro 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


6£ 
CCT 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


CCT 
Pro 


AAT 

Asn, 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 


CCA 
Pro 


700 
AAC GTA 
Asn RaT 


GAT 
Aspl 


CCT 
Pro 


AAT 
Asn 


GCA 
Ala 


AAT 
Asn 


CCA 
Pro 


AAT 
Asn 


GCA 
Ala 


AAC 
Asn 



Fig. 3A b 



0278S4G 



750 



OOA 


A AX 

AA 1 


GCA 


AAC 


ULrA 


AAU 


oUA 


A AO 

AAU 


UUU 


rrO 


Asn 


Ala 


Asn 


rrQ 


A f\ m 

Asn 


Aia 


A am 

Asn 


rro 


AAT 


GCA 


AAT 


CCT 


AAT 


GCA 


AAC 


CCC 


AAT 


Acn 
nol i 


Ala 
Mid 


Asn 


Pro 


Aon 
Moll 


Ala 
Mia 


Msn 


rrO 


asm 










ouu 










GCA 


AAT 


CCT 


AAT 


GCA 


A A T 

AAT 


CCT 


A A *^ 

AAT 


GCC 


Ala 


Asn 


Pro 


Asn 


A 1 

Ala 


Asn 


Pro 


Asn 


A 1 — 

Ala 


AAT 


CCA 


AAT 


GCA 


AAT 


CCA 


AAT 


GCA 


AAC 


ASM 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


A U 

Ala 


Asn 


















CCA 


AAC 


GCA 


AAC 


CCC 


A A 

AAT 


GCA 


A A 

AAT 


CCT 


Pro. 


Asn 


Ala 


Asn 


Pro 


Asn 


A 1 

Ala 


Asn 


Pro 


AAT 


GCC 


AAT 


CCA 


a a t~ 

AAT 


GCA 


AAT 


CCA 


AAT 


Asn 


Ala 


Asn 


Pro 


Asn 


A 1 

Ala 


Asn 


Pro 


Asn 




yuu 














GCA 


AAC 


CCA 


AAT 


GCA 


AAC 


CCA 


AAT 


GCA 


Aia 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


A A /"N 

AAC 


CCC 


AAT 


GCA 


AAT 


CCT 


AAT 


AAA 


AAC 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Lys 


Asn 
















AAT 


CAA 


GGT 


AAT 


GGA 


CAA 


GGT 


CAC 


AAT 


Asn 


Gin 


Gly 


Asn 


Gly 


Gin 


Gly 


His 


Asn 










1000 


ATG 


CCA 


AAT 


GAC 


CCA 


AAC 


CGA 


AAT 


GTA 


Met 


Pro 


Asn 


Asp 


Pro 


Asn 


Arg 


Asn 


Val 


GAT 


GAA 


AAT 


GCT 


AAT 


GCC 


AAC 


AAT 


GCT 


Asp 


Glu 


Asn 


Ala 


Asn 


Ala 


Asn 


Asn 


Ala 












1050 




GTA 


AAA 


AAT 


AAT 


AAT 


AAC 


GAA 


GAA 


CCA 


Val 


Lys 


Asn 


Asn 


Asn 


Asn 


Glu 


Glu 


Pro 


AGT 


GAT 


AAG 


CAC 


ATA 


GAA 


CAA 


TAT 


TTA 


Ser 


Asp 


Lys 


His 


lie 


Glu 


Gin 


Tyr 


Leu 










Pig. 


3A c 









0278940 



1100 

AAG AAA ATA AAA AAT TCT ATT TCA ACT 

Lys Lys lie Lys Asn Ser He Ser Thr 

REGION II 

GAA TGG TCC CCA TGT AGT GTA ACT TGT 

Glu Trp Ser Pro Cys Ser Val Thr Cys 



GGA 


AAT 


GGT 


AAT 


CAA 


GTT 


AGA 


ATA 


AAG 


Gly 


Asn 


Gly 


He 


Gl 


I 


Arg 


lie 


Lys 


CCT 


GGC 


TCT 


GCT 


AAT 


AAA 


CCT 


AAA 


GAC 


Pro 


Gly 


Ser Ala 
1200 


Asn 


Lys 


Pro 


Lys 


Asp 


GAA 


TTA 


GAT 


TAT 


GAA 


AAT 


GAT 


ATT 


GAA 


Glu 


Leu 


Asp 


Tyr 


Glu 


Asn 


Asp 


He 


Glu 


AAA 


AAA 


ATT 


TGT 


AAA 


ATG 


GAA 


AAA 


TGT 


Lys Lys 
1250 


lie 


Cys 


Lys 


Met 


Glu 


Lys 


Cys 


TCC 


AGT 


GTG 


TTT 


AAT 


GTC 


GTA 


AAT 


AGT 


Ser 


Ser 


Val 


Phe 


Asn 


Val 


Val 


Asn 


Ser 


TCA 
















1300 


ATA 


GGA 


TTA 


ATA 


ATG 


GTA 


TTA 


TCC 


Ser 


lie 


Gly 


Leu 


lie 


Met 


Val 


Leu 


Ser 


TTC 


TTG 


TTC 


CTT 


AAT 


TAG 


ATAAAGAACACAT 


Phe 


Leu 


Phe 


Leu 


Asn 
















1350 










CTTAGI I I GAGTTGTACAATATTT 


ATAAAAATATATACTACTTT 



1400 

TTTTCTT AATTTTC ATTTTTCTTT ATATTTTC C G AG G G AATTT AT 

1450 

TTTTTTGTG A ATATTTAATT ACGTTTG CG ATTAATTGTAG AAAT 

1500 

ATATATGTATATACTATATTTATAGAATGTGTTATTCTCAAAAA 



Fig. 3A d 



0278940 



1550 

CAACAACAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAGG 

ATTAAAAGTAAAATAGTTATAAATATTTTCAAAAATATTTATAA 
1600 

CACMAAAATACTTCGAAGTTCATTTMCATTTTTGTTTATTT 
1650 

ATTTATTTATATATTTCA I I I I I ACGTATTTATATTATAAAATG 

1700 

GTGTATCTTAAAAATAGTGAACTATATATATAAAATATTAATT 

1750 

TAAAAAAATTATAACTTTC I I I I I ATTTTCTAAAATAACTTAAA 

1800 

AATTATATGTTTAAGAAAGGGGTAAATTATAATATTTGTATAA 

1850 

ATATATAAACATAGATATATTAAATAAAATAACAAATGTACTA 

TATTTGTGCATAAGACGTATACGCTTTATATAATACAACAATA 

TTAATTGTAATAATATTTGTGGTAGTGTGAACACTAAAATTGA 
1950 

TAATAATGATTATAATACAGAAGAAATAAAAAATGAATCCAAT 

2000 

ATAGGATTTACAACAAATATTCATGAAGCAAAAATAATTCAAG 

2050 

AAAAGACATATGGATTAATAATAAACGATAAAATAAAGAAAG 

Fig. 3A e 



0278S40 



2100 

AAGAATATGATGATTGTAATAATAATAATAATAATAATATTAT 

2150 

AATACAGATAAGAGAAGTTGGACTTAATTATTTTGGAGATAC 



TCTCGATGAATCGAATCCATGTAATGATCTTACAGGTATTAA 
2200 

TATATGGGAAAGTTGTCTTGTGGCTAGTCGATGGTTTAGCG 

2250 

ATTTATCTTTACAG AA I I I I I I I I CGAATAAAAATATTTTAGA 

2300 

AATTGGTGCTG GCAGTGGTTTG GCTAGTATAATAATATTTAT 



ATATTCTAATATTTACAA 



Fig. 3A f 



0278940 

Sequence of the Pre si- Pre S2 region of pRIT 12792 



PRE-S1 REGION 

Ncol 
-163 



COATG 


GGG 


ACG 


Met 


Glv 


Thr 


CCT 


CTG 


GGA 


Pro 


Leu 


Glv 


TTG 


GAC 


CCT 


Leu 


Asd 


Pro 


AAC 


AAT 


CCA 


Asn 


Asn 


Pro 


ATC 


AAG 


GAC 


Ho 


LyS 


Asp 


CAG 


GTA 


GGA 




val 


Gly 


GGG 


CTC 


ACC 


Gly 


Leu 


Thr 


TTG 


GGG 


TGG 


Leu 


Gly 


Trp 


ATA 


TTG 


ACC 


He 


Leu 


Thr 


CCT 


CCT 


GCC 


Pro 


Pro 


Ala 


GGA 


AGG 


CAG 


Gly 


Arg 


Gin 


CCT 


CTA 


AGA 


Pro 


Leu 


Arg 



AAT 
Asn 


CTT 
Leu 


TCT 
Ser 


TTC 
Phe 


1 1 1 
Phe 


CCC 
Pro 


GCA 
Ala 


TTC 
Phe 


GGA 
Gly 


GAT 
Asp 


TGG 
Trp 


GAC 
Asp 


CAC 
His 


TGG 
Trp 


CCA 
Pro 


GTG 
Val 


GGA 
Gly 


GCA 
Ala 


CCT 
Pro 


CCA 
Pro 


CAC 
His 


AGC 
Ser 


CCT 
Pro 


CAG 
Gin 


ACA 
Thr 


GTG 
Val 


TCA 
Ser 


TCC 
Ser 


ACC 
Thr 


AAT. 
Asn 


CCT 
Pro 


ACT 
Thr 


CCC 
Pro 


GAC 
Asp 


AGT 
Ser 


CAT 
His 


Fig 


. 4A a 





GTT 

Val 


CCC 
Pro 


AAC 
Asn 


GAT 
Asp 


CAT 
His 


CAG 
Gin 


GCC 
Ala 


AAC 
Asn 


TCA 
Ser 


TTC 
Phe 


AAC 
Asn 


CCC 
Pro 


GCA 
Ala 


GCC 
Ala 


AAC 
Asn 


TTC 
Phe 


GGG 
Gly 


CCA 
Pro 


GGC 
Gly 


GGT 
Gly 


ATT 
lie 


GCT 
Ala 


CAG 
Gin 


GGC 
Gly 


ACA 
Thr 


ATT 
He 


CCT 
Pro 


CGG 
Arg 


CAG 
Gin 


TCA 
Ser 


ATC 
He 


TCT 
Ser 


CCA 
Pro 


CCT 
Pro 


CAG 
Gin 


GCC 
Ala 



0*27 89^0 



PRE-S2 REGION 



rr 

-55 

ATG 

Met 


CAG 
Gin 


TGG 
Trp 


AAT 
Asn 


TCC 
Ser 


ACT 
Thr 


GCC 
Ala 


TTC 
Phe 


CAC 
His 


CAA 
Gin 


GCT 
Ala 


CTG 
Leu 


CAG 
Gin 


GAT 
Asp 


CCC 
Pro 


AGA 
Arg 


GTC 
Val 


AGG 
Arg 


GGT 
Gly 


CTG 
Leu 


TAT 
Ty r 


TTT 
Phe 


CCT 
Pro 


GCT 
Ala 


GGT 
Gly 


GGC 
Gly 


TCC 
Ser 


AGT 
Ser 


TCA 
Ser 


GGA 
Gly 


ACA 
Thr 


GTA 
Val 


AAC 
Asn 


CCT 
Pro 


GCT 
Ala 


CCG 
Pro 


AAT 
Asn 


ATT 
He 


GCC 
Ala 


TCT 
Ser 


CAC 
His 


ATT 
lie 


TCG 
Ser 


TCA 
Ser 


AGC 
Ser 


TCC 
Ser 


GCG 
Ala 


AGG 
Arg 

Xba 1 


ACT 
Thr 


GGG 
Gly 


GAC 
Asp 


CCT 
Pro 


GTG 
Val 


ACG 
Thr 



AA TAATAATCTAGA 



Pig. 4A b 




Figure 1B 



0278S4G 



5' 



Bglll 



.Mboll 3' 



117 bp fragment fror 
pRIT12901 



Mung bean nuclease 



TTC 
AAG 



t 



GTGC 
CACG 



t 

EC7-preS2-S 
ARG 
AGG 

TACCTCCTCCTCTATACTCC 



.pRIT10911/BamHI/ 
mung bean nuclease 



VAL PRO 

GTGC CC 
-CACG GG 




preS2-S 



pTDH3 



Figure 2B 



Hha I - Hind III fragment (230bp) op pRIT12901 




Figure 3B 



0278940 



p. Dreesman (Synthetic) 




Figure 4B 



0278940 



PRIT12793 




Ncol 

T4 polymerase 
Xbai 

purify 502 bp fragment 




Figure 5B 



